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5 TITLE 

Receptor for Oncostatin M and Leukemia Inhibitory Factor 

10 

BACKGROUND OF THE INVENTION 

Receptors that bind specific molecules (e.g., a hormone, drug, cytokine, or 
biochemical) have been identified on a multitude of cell types. Receptors are found on 

15 the cell surface or, in the case of soluble receptors, are released into the serum. Effort 
has been directed toward isolation and characterization of a number of receptors in 
order to study their physiological roles and to explore possible therapeutic uses. The 
binding of a particular target molecule by a soluble receptor administered to a patient 
may alleviate disorders mediated by the target molecule. 

20 Certain receptors have been found to comprise two separate polypeptide chains 

associated in the form of a complex. Such two-chain receptors often bind the target 
molecule with greater affinity than that exhibited by one of the chains alone. 

Leukemia inhibitory factor (LIF) is a polypeptide hormone that plays a central 
role in the regulation of diverse adult and embryonic systems. LIF acts on a variety of 

25 cell types and has multiple biological activities. The diversity in biological activity is 
reflected in the various synonyms of LIF, which include hepatocyte stimulating factor 
ITT (Baumann and Wong, /. Immunol. 743:1163 [1989]); cholinergic nerve 
differentiation factor (Yamamori et al., Science 246: 1412 [1990]); melanoma-derived 
lipoprotein lipase inhibitor (Mori et al., Biochem. Biophys. Res. Comm. 760:1085 

30 [1989]); human interleukin for DA cells (Moreau et al., Nature 336:690 [1988]); 
differentiation factor (Tomida et al., 7. Biol. Chem. 259:10978 [1984]); differentiation 
inhibitory factor (Abe et al., /. Biol. Chem. 264; 8941 [1989]); differentiation 
inhibitory activity (Smith and Hooper, Devel Biol;. 121:1 [1987]); and differentiation 
retarding factor (Koopman and Cotton, Exp. Cell. Res. 754:233 [1984], 

35 The cloning of a leukemia inhibitory factor receptor (LIF-R) has been reported 

by Gearing et al. in EMBO J. 70:2839 (1991). This recombinant single-chain LDF-R 
polypeptide binds LIF, but with lower affinity than the naturally occurring LIF 
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receptors found on certain normal cells. A receptor that binds LIF with higher affinity 
than that exhibited by this cloned single chain LIF-R would be desirable for certain 
applications. 

Oncostatin M is a secreted single-chain polypeptide cytokine that regulates the 
5 growth of certain tumor-derived and normal cell lines. Oncostatin M is produced by 
activated lymphoid cells. A number of cell types have been found to bind the 
oncostatin M protein. See, for example, Linsley et aL, /. Biol. Chem., 264: 4282 
(1989). However, the isolation and characterization of an oncostatin M receptor have 
not been reported. 

10 

fiynyr^AttV OF THT. TKVF.NTTON 
The present invention provides a receptor that has the property of binding both 
oncostatin M and leukemia inhibitory factor (LIF). The receptor comprises gpl30 
linked (preferably covalently) to leukemia inhibitory factor receptor (LIF-R). The 

15 gpl30 polypeptide may be covalently linked to the LIF-R polypeptide by any suitable 
means, such as via a cross-linking reagent or a polypeptide linker. In one embodiment 
of the invention, the receptor is a fusion protein produced by recombinant DNA 
technology. Disorders mediated by either oncostatin M or LIF may be treated by 
administering a therapeuticaUy effective amount of the inventive receptor to a patient 

20 afflicted with such a disorder. 

rmr pf?^ 1 ^™* oir xhb praisings 

Figure i is a graph presenting the results of an LIF binding assay. Host cells 
transfected with vectors) encoding gp 130 or LIF-R were assayed for the ability to bind 

25 LIF, as described in example 1. 

Figure 2 is a graph presenting the results of an oncostatin M binding assay. 
Host cells transfected with vector(s) encoding gpl30 or LIF-R were assayed for the 
ability to bind oncostatin M, as described in example 2. 

Figure 3 is a graph depicting low affinity binding of oncostatin M to host cells 
30 transfected with a gpl30 encoding expression vector, as described in example 2. 

Figure 4 schematically depicts a receptor of the present invention wherein Fc 
polypeptides derived from an antibody are used to link a gpl30 fragment to an LIF-R 
fragment. 

Figure 5 presents composite DNA and encoded amino acid sequences of a full 
35 length LIF-R, determined by comparing the sequences of cDNA and genomic clones. 
The signal peptidase cleavage site is marked with a vertical arrow. The transmembrane 
region is heavily underlined. Potential N-linked glycosylation sites are marked with 
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asterisks. Hallmark residues associated with the hematopoietic family of receptors are 
shown boxed. The horizontal arrow marks the point at which genomic sequence was 
used to derive the 3 1 coding region of LIF-R, since the cDNA clones employed in 
determining this sequence terminated with a stretch of A nucleotides at this point. 

5 Figure 6 presents the DNA and deduced amino acid sequences of cloned gpl30 

cDNA as reported by Hibi et ah in Cell 63: 1 149 (1990). A predicted signal sequence is 
underlined. The thick underline indicates a presumed transmembrane region. The sets 
of asterisks identify possible N-glycosylation sites. 

Figure 7 presents Scatchard analyses that demonstrate the interaction of a 

10 soluble gpl30/Fc fusion protein with soluble LIF-R/Fc in binding LIF and oncostatin 
M, as described in example 7. 



DETAILED DESCRIPTION OF THE INVENTION 
15 The present invention provides a receptor comprising gpl30 covalently linked 

to leukemia inhibitory factor receptor (LIF-R). In another embodiment of the 
invention, the receptor comprises gpl30 non-covalently complexed with LIF-R. The 
receptor is capable of binding oncostatin M, and also binds leukemia inhibitory factor 
(LIF). The receptor thus is useful for treating disorders mediated by either oncostatin 
20 M or LIF, 

The gpl30 may be covalendy linked to the LIF-R by any suitable means, such 
as via a cross-linking reagent or a polypeptide linker. The gpl30 and LIF-R proteins 
are covalently linked in a manner that does not interfere with the resulting receptor's 
ability to bind oncostatin M and LIF. In one embodiment of the invention, the receptor 

25 is a fusion protein produced by recombinant DNA technology. 

Non-covalent bonding of gpl30 to LIF-R may be achieved by any suitable 
means that does not interfere with the receptor's ability to bind oncostatin M and LIF. 
In one approach, a first compound is attached to LIF-R and a second compound that 
will non-covalently bond to the first compound is attached to gpl30. Examples of 

30 such compounds are biotin and avidin. The receptor is thus formed through the non- 
covalent interactions of biotin with avidin. In one embodiment of the invention, LIF-R 
and gpl30 are recombinant polypeptides, each purified from recombinant cells and then 
non-covalently bonded together to form the receptor. A host cell may be transformed 
with two different expression vectors such that both LIF-R and gp 130 are produced by 

35 the recombinant host cell. LIF-R and gpl30 (one or both of which are soluble 
fragments as described below) produced by such transformed host cells may associate 
to form a complex through non-covalent interactions. 
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"Leukemia inhibitory factor receptor" (LIF-R) refers to a protein (a cytokine 
receptor) that is present on the surface of various hematopoietic cells, including 
monocyte-macrophages and megakaryocytes, and on non-hematopoietic cells, 
including osteoblasts, placental trophoblasts, and liver parenchymal cells. LIF-R is 
5 capable of binding leukemia inhibitory factor (LIF) molecules and plays a role in 
transducing the signal provided by LIF to a cell. In the absence of any species 
designation, LIF-R refers generically to mammalian LIF-R, which includes, but is not 
limited to, human, murine, and bovine LEF-R. 

The cl oning of human and murine leukemia inhibitory factor receptors (LIF-R), 

10 each a single polypeptide chain, has been reported by Gearing et al. in EMBO J. 
10:2839 (1991), which is hereby incorporated by reference in its entirety. The DNA 
sequence of a human LIF-R cDNA clone and the amino acid sequence encoded thereby 
are shown in SEQ ID NO: 5 and SEQ ID NO: 6. This cloned human cDNA encodes an 
N-terminal fragment of human LIF-R that includes On order from N-terminus to C- 

15 terminus) a 44-amino acid signal sequence (amino acids -44 to -1), the entire 
extracellular region, a transmembrane region (the first amino acid of which is amino 
acid number 790 of SEQ ID NO: 5) and a portion of the cytoplasmic domain. The C- 
terminus of the fragment includes amino acids encoded by a poIy-A segment and by a 
linker employed in vector construction, as described in Gearing et aL, supra. The term 

20 "transmembrane region" as used herein refers to a string of hydrophobic amino acids 
positioned between the extracellular domain and the cytoplasmic domain of the protein. 
A plasmid vector containing the -above-described cloned human LTF-R cDNA is 
designated pHLIFR-65 and has been deposited in E. coli host cells with the American 
Type Culture Collection on December 11, 1990 (ATCC accession no. 68491). The 

25 DNA and amino acid sequences of a full length native human LIF-R (determined by 
comparing the sequences of cDNA and genomic clones) have been reported by Gearing 
et aL supra and are presented herein in Figure 5. 

The LEF-R encoded by the cloned cDNA (SEQ ID NO: 6) contains the entire 
extracellular region of LIF-R (the domain believed to be responsible for the UF- 

30 binding activity), and binds LIF, but with lower affinity than does a naturally occurring 
LIF receptor found on certain normal cells. Additionally, oncostatin M competes with 
T .TP for binding to naturally occurring high af finit y LTF receptors on certain cell types 
(Gearing et al., New Biologist, 4:61, 1992) but did not bind to the above-described 
cloned LIF-R expressed in COS cells. 

35 jji order to investigate the possible existance of a high affinity converting 

subunit for the cloned single polypeptide chain LIF-R, host cells were co-transfected 
with the LTF-R encoding plasmid pHLTFR-65 and with pools from a human placental 
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cDNA library (also contained in an expression vector). The co-transfected cells were 
assayed for the ability to bind radiolabeled oncostatin M. 

A positive cDNA pool was subdivided and the procedure repeated to isolate a 
single cDNA clone designated B10G that conferred the ability to bind oncostatin M on 
5 cells co-transfected with B10G and the LIF-R encoding plasmid pHLEFR-65. The co- 
transfected cells also were found to bind LIF with higher affinity than cells transfected 
with pHLDFR-65 alone. Host cells transfected with B10G alone exhibited low affinity 
oncostatin M binding sites. The B10G cloned cDNA was sequenced and found to 
encode a protein that is known as gpl30. 

10 Thus, it has now been found that a receptor comprising both LIF-R and gpl30 

binds LIF with higher affinity than does the single-chain LIF-R polypeptide alone. The 
improved UF binding of LEF-R in combination with gpl30 is described in example 1 
below and depicted in Figure 1. 

Although LIF does not bind to either high- or low-affinity oncostatin M 

15 receptors, it has now been found that oncostatin M binds to the receptors of the present 
invention comprising LIF-R and gpl30. Oncostatin M binding is described in example 
2 below and depicted in Figure 2. 

A protein known as gpl30 has been purified from cellular sources that include 
placental tissue and a myeloma cell line U266. A number of additional cell types have 

20 been found to express gpl30 mRNA, as reported by Hibi et al., in Cell 6?: 1149 
(1990). gpl30 has been reported to be involved in the formation of high affinity 
interleukin-6 binding sites and in IL-6 signal transduction (Hibi et al. supra). The 
cloning and expression of cDNA encoding a full length gpl30 protein has been 
reported by Hibi et al., supra, which is hereby incorporated by reference in its entirety. 

25 The DNA and deduced amino acid sequences reported by Hibi et al. for the gpl30 
cloned cDNA are presented herein in figure 6. The gpl30 amino acid sequence may 
vary from that reported by Hibi et al., e.g., leucine may be substituted for valine at 
position 8 in the signal sequence (numbering is as shown in Figure 6). This amino acid 
substitution may be attributable to genetic polymorphism (allelic variation among 

30 individuals producing the protein), and results from the presence of C rather than G at 
nucleotide position 22. 

As used herein, the term LEF-R includes variants and truncated forms of native 
LIF-R proteins that possess the desired LIF-binding or signal transducing activity. 
Likewise, the term gpl30 as used herein includes variants and truncated forms of the 

35 native gpl30 protein that retain the desired biological activity. For gpl30, the desired 
biological activity includes binding of oncostatin M; conferring on the inventive 
receptor the ability to bind oncostatin M; and increasing the affinity of the inventive 
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receptor for UF, compared to the LIF binding affinity of the single-chain LIF-R 
polypeptide alone. Specifically included are truncated, soluble or fusion forms of I2F- 
R and gpl30, as described below. Variants produced by adding, substituting, or 
deleting amino acid(s) in the native sequence are discussed in more detail below. 

5 One example of an LIF-R polypeptide that may be employed is that encoded by 

me cDNA clone designated pHLIF-R-65 (SEQ ID NO: 5), as described by Gearing et 
aL, supra and in example 3 below. Alternatively, a fragment comprising amino acids 1 
to 945 of SEQ ID NO:5 may be employed. Amino acid 945 is the last UF-R-specific 
amino acid of the polypeptide encoded by clone pHLIF-R-65, before the poly-A 

10 nucleotide segment believed to result from oligo(dT) priming at an internal site in the 
mRNA during preparation of the bXJF-R cDNA. (See Gearing et aL, EMBOJ., supra. 

at page 2840, column one.) 

Other examples of LEF-R polypeptides that may be employed in the inventive 
receptors include those lacking all or part of the transmembrane region or the 

15 cytoplasmic domain of the protein. Suitable LIF-R polypeptides thus include those 
con taining amino acids 1-x or, when the signal sequence is not desired, amino acids 
45-x of the full length LIF-R sequence depicted in Figure 5, wherein x represents an 
integer from 833 to 1096. Amino acid number 833 is the last amino acid of the 
extracellular domain (Le., before the start of the transmembrane region.) Polypeptides 

20 terminating in amino acid number 1096 lack the IastC-terminal amino acid of the full 
length protein. The desirability of including the signal sequence depends on such 
factors as the position of LIF-R in a fusion protein, as discussed below, and the 
intended host cells when the receptor is to be produced via recombinant DNA 
technology. Note that the numbering of amino acids in Figure 5 (taken from Gearing et 

25 aL, supra) differs from that of SEQ ID NO: 5 because the first amino acid of the signal 
sequence is designated amino acid number 1 in Figure 5 but is designated -44 in SEQ 
ID NO: 5. Other polypeptides may be chosen with regard to sequences that are 
conserved in the hematopoietin receptor family, (Le., chosen to include the boxed 
sequence(s) shown in Figure 5.) 

30 One example of a suitable gpI30 polypeptide is that encoded by cDNA cloned 

into plasmid vector pDC303 to produce a plasmid designated B10G. The source of 
mRNA used in producing the cDNA was human placental tissue. Plasmid BlOGinE. 
coli strain DH5a host cells was deposited with the American Type Culture Collection, 
Rockvflle, Maryland, on November 14, 1991, and assigned ATCC accession number 

35 68827. 

The DNA sequence of the gpl30 cDNA contained in plasmid B10G and the 
amino acid sequence of the gpl30 protein encoded by the cloned cDNA are presented in 
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SEQ ID NO; 1 and SEQ ID NO: 2. The protein comprises (in order from the N- 
terminus to the C-terminus) a 22-amino acid signal sequence, complete extracellular 
domain (amino acids 1-597), a transmembrane region (beginning with amino acid 598), 
and a partial cytoplasmic domain (amino acids 621-686). This truncated gpl30 
5 polypeptide differs from the equivalent portion of the Hibi et al. protein in that the 
eighth amino acid of the signal sequence is leucine rather than valine, as discussed 
above. 

Another example of a suitable gpl30 polypeptide comprises amino acids 1 to 
496 of the SEQ ID NO: 1, which includes all of the cysteine residues found in the 

10 extracellular domain of the protein, and also contains a complete fibronectin domain. 
Additional examples of gpl30 polypeptides are those comprising amino acids 1-298 or 
98-298 of SEQ ID NO: 1. 

Other gpl30 polypeptides lacking all or part of the transmembrane region and/or 
cytoplasmic domain may be employed. Suitable gpl30 polypeptides thus include those 

15 containing amino acids 1-x or, when the signal sequence is not desired, amino acids 
23-x of the Figure 6 sequence, wherein x represents an integer from 619 to 917. The 
first amino acid of the transmembrane region is the alanine residue at position 620 in 
Figure 6. Polypeptides terminating at amino acid 917 lack the last C-terminal amino 
acid of the full length protein presented in Figure 6. Note that the numbering of amino 

20 acids in Figure 6 (taken from Hibi et al., supra) differs from that shown in SEQ ID NO: 
1 and NO:2 because the first amino acid of the signal sequence is designated amino acid 
number 1 in Figure 6 but is designated -22 in SEQ ID NO: 1. Regions of the gp 130 
protein corresponding to domains that are conserved among certain receptors are 
discussed by Hibi et al, supra, at page 1150, column 2, and page 1151, column 1. 

25 Other truncated gpl 30 polypeptides chosen to include these conserved regions may be 
employed. 

Preferred UF-R and gpl 30 polypeptides are those which are soluble. In one 
embodiment of the present invention, the receptor comprises soluble LIF-R covalently 
attached to soluble gpl30. "Soluble LEF-R" as used in the context of the present 

30 invention refers to polypeptides that are substantially similar in amino acid sequence to 
all or part of the extracellular region of a native LIF-R and that, due to the lack of a 
transmembrane region that would cause retention of the polypeptide on a cell 
membrane, are secreted upon expression. The soluble LIF-R polypeptides that may be 
employed retain the ability to bind LIF or, by competitively binding LIF, inhibit LIF 

35 signal transduction activity via cell surface bound LIF-R proteins. Soluble LIF-R may 
also include part of the transmembrane region or part of the cytoplasmic domain or 
other sequences, provided that the soluble LIF-R protein is capable of being secreted. 
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Likewise, the term "soluble gpl30 M as used herein refers to proteins that are 
substantially similar in amino acid sequence to all or part of the extracellular region of a 
native gpl30 and are secreted upon expression but retain the desired biological activity. 
Soluble gpl30 may include part of the transmembrane region, cytoplasmic domain, or 

5 other sequences, as long as the polypeptide is secreted. 

Soluble LIF-R and soluble gpl30 may be identified (and distinguished from 
their non-soluble membrane-bound counterparts) by separating intact cells which 
express the desired protein from the culture medium, e.g., by centrifugation, and 
assaying the medium (supernatant) for the presence of the desired protein. The culture 

10 medium may be assayed using procedures which are similar or identical to those 
described in the examples below. The presence of LIF-R or gpl30 in the medium 
indicates that the protein was secreted from the cells and thus is a soluble form of the 
desired protein. Soluble LIF-R and soluble gpl30 may be naturaUy-occurring forms of 
these proteins. Cloning of a naturaUy-occurring soluble murine LIF-R is reported in 

15 Gearing et aL, supra. Alternatively, soluble fragments of LIF-R and gpl30 proteins 
may be produced by recombinant DNA technology or otherwise isolated, as described 
below. 

The use of soluble forms of LIF-R and gpl30 is advantageous for certain 
applications. Purification of the proteins from recombinant host cells is facilitated, 

20 since the soluble proteins are secreted from the cells. Further, soluble proteins are 
generally-more suitable for intravenous adrninisuation and may exert their therapeutic 
effect (binding LIF and oncostatin M) in the bloodstream. 

Soluble LIF-R polypeptides include those comprising the signal sequence and 
entire extracellular domain (arrrino acids -44 to 789 of SEQ ID NO: 5) or lacking the 

25 signal sequence but containing the entire extracellular domain (amino acids 1 to 789 of 
SEQ ID NO: 5). Soluble gpl30 polypeptides include those comprising the signal 
sequence and entire extracellular domain (amino acids -22 to 597 of SEQ ID NO: 1) or 
lacking the signal sequence but containing the entire extracellular domain (amino acids 1 
to 597 of SEQ ID NO: 1). The preparation and use of these soluble polypeptides in 

30 receptors of the present invention is described in examples 3-5. 

Other soluble LIF-Rs are truncated upstream of the transmembrane region, but 
preferably include that portion of the protein that contains the residues conserved 
among the members of the hematopoietin receptor family (shown boxed in Figure 5), 
i.e., amino acids 11-479 of SEQ ID NO:6. The N-terminus of such soluble LIF-Rs is 

35 any of arnino acids 1-1 1 (or -44 when the native signal sequence is included), and the 
protein extends to a C-terminus selected from any of amino acids 479 through 789. 
Two such soluble proteins comprise amino acids -44 - 702, 1 - 702, -44 - 775, or 1 - 
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755 of SEQ ID NO:6. Constructs encoding these proteins may be prepared by 
techniques that involve cleaving the human LIF-R cDNA of the above-described clone 
pHLIFR-65 (ATCC 68491) with the restriction endonucleases Asp718 andXmnl or 
with Asp718 and Bspl286I. Asp 718 cleaves the vector upstream of the inserted LDF- 
5 R-encoding cDNA. Xmnl cleaves within the codon for Asp at position 702 
(generating blunt ends) and Bspl286I cleaves just 3* of the codon for Val at position 
775 of SEQ ID NO:5. If desired, an oligonucleotide may be ligated to the 3' end of the 
Asp718IBsp 12861 fragment to extend the LIF-R sequence, e.g., through amino acid 
number 789. An oligonucleotide also may be ligated to the 3' end of a LIF-R fragment 
10 to add the first two amino acids of the Fc polypeptide described in example 5, and a Bgl 
II site useful for attaching the rest of the Fc sequence downstream of the LIF-R 
sequence. 

Additional soluble human LIF-Rs comprise amino acids 1-678 or 1-680 of SEQ 
ID NO:6. When the human and murine LIF-R amino acid sequences disclosed in 

15 Gearing et aL, EMBO supra, are aligned (with gaps introduced to maximize identity 
between the two sequences), amino acid 680 of the human sequence is aligned with the 
last amino acid of the murine protein, and amino acid 678 is the last amino acid of the 
human sequence that is identical to a corresponding amino acid in the murine sequence. 
Since the murine protein binds LIF, the murine LIF-R contains that portion of the 

20 protein required for LIF binding. 

An additional example of a soluble gpl30 polypeptide comprises amino acids - 
22 to 582 of SEQ ID NO:2. An expression vector encoding such a protein was 
constructed in example 7. Soluble LIF-R and gpl30 polypeptides also include those 
from which fibronectin type HI (FNIII) domains have been deleted. From one to all of 

25 the FNIII domains may be deleted, providing the advantage of reducing the size of the 
protein. Preparation of such LIF-R and gpl30 proteins is described in example 8. 

Truncated LIF-R and gpl30, including soluble polypeptides, may be prepared 
by any of a number of conventional techniques. In the case of recombinant proteins, a 
DNA fragment encoding a desired fragment may be subcloned into an expression 

30 vector. Alternatively, a desired DNA sequence may be chemically synthesized using 
known techniques. DNA fragments also may be produced by restriction endonuclease 
digestion of a full length cloned DNA sequence, and isolated by electrophoresis on 
agarose gels. Linkers containing restriction endonuclease cleavage site(s) may be 
employed to insert the desired DNA fragment into an expression vector, or the fragment 

35 may be digested at cleavage sites naturally present therein. Alternatively, proteins may 
be fragmented using proteolytic enzymes, for example, and the desired truncated 
polypeptide isolated from the digestion mixture using reversed phase HPLC. 
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The well known polymerase chain reaction procedure also may be employed to 
isolate a DNA sequence encoding a desired protein fragment This technique is 

illustrated in examples 3-5 below. 

In another approach, enzymatic treatment (e.g., using Bal 31 exonuclease) may 

5 be employed to delete terminal nucleotides from a DNA fragment to obtain a fragment 
having a particular desired terminus. Among the commercially available linkers are 
those that can be Iigated to the blunt ends produced by Bal 31 digestion, and which 
containrestrictionendonncleasecleavagesite(s). Alternatively, oligonucleotides that 
reconstruct the N- or C- terminus of a DNA fragment to a desired point may be 

10 synthesized. The oligonucleotide may contain a restriction endonuclease cleavage site 
upstream of the desired coding sequence and position an initiation codon (ATG) at the 
N-terminus of the coding sequence. 

The gpl30 polypeptide is attached to the LIF-R polypeptide through a covalent 
or non-covalent linkage. Covalent attachment is preferred for certain applications, e.g. 

15 in vivo use, in view of the enhanced stability generally conferred by covalent, as 
opposed to non-covalent, bonds. In constructing the receptor of the present invention, 
covalent linkage may be accomplished via cross-linking reagents, polypeptide linkers, 

or any other suitable technique. 

Numerous reagents useful for cross-linking one protein molecule to another are 

20 known. Heterobifunctional and homobifunctional linkers are available for this purpose 
from Pierce Chemical Company, Rockford, Illinois, for example. Such linkers contain 
two functional groups (e.g., esters and/or maleimides) that will react with certain 
functional groups on amino acid side chains, thus linking one polypeptide to another. 
The reagent and reaction conditions should be chosen such that the cross-linking does 

25 not interfere with binding of oncostatin M andlJF to the receptor. 

One type of polypeptide linker that may be employed in the present invention 
separates gpl30 and UF-R domains by a distance sufficient to ensure that each domain 
properly folds into the secondary and tertiary structures necessary for the desired 
biological activity. The linker also should allow the extracellular domains of gpl30 and 

30 UF-R to assume the proper spatial orientation to form the binding site for oncostatin M 
and LIF. Suitable polypeptide linkers preferably (1) will adopt a flexible extended 
conformation, (2) will not exhibit a propensity for developing an ordered secondary 
structure which could interact with the functional gpl30 and LIF-R domains, and (3) 
w31 have minimal hydrophobic or charged character which could promote interaction 

35 with the functional protein domains. Typical surface amino acids in flexible protein 
regions include Gly.Asn and Ser. Virtually any permutation of amino acid sequences 
containing Gly, Asn and Ser would be expected to satisfy the above criteria for a 
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peptide linker sequence. Other near neutral amino acids, such as Thr and Ala, may also 
be used in the linker sequence. Examples of such polypeptide linkers are presented 
below. 

Another type of polypeptide linker that may be employed comprises the Fc 
5 region of an antibody. An Fc polypeptide is attached to the C-terminus of LIF-R or of 
the LIF-R fragment A separate Fc polypeptide is attached to the C-terminus of gpl30 
or of the gpl30 fragment The two resulting polypeptide chains may be combined in a 
buffered solution, whereupon disulfide bonds form between the two Fc polypeptides 
(e.g., in the so-called hinge region, where interchain disulfide bonds are normally 

10 present in antibody molecules). Preferably, a host cell is transformed with DNA 
encoding both polypeptides such that the two polypeptides are co-expressed and 
interchain disulfide bonds form in the host cell. LIF-R is thus covalently linked to 
gpl30 via the disulfide bonds in the linker portion of the receptor. Procedures for 
isolating the Fc region of an antibody are well-known and include proteolytic digestion 

15 with papain. Alternatively, an Fc polypeptide may be produced by recombinant cells or 
chemically synthesized. Also useful are N-terminal fragments of an antibody Fc region 
that contain the cysteine residues involved in disulfide bond formation at the hinge 
region. One example of a receptor containing an Fc polypeptide linker is illustrated in 
example 5 below. The receptor is depicted in Figure 4. The number and position of 

20 disulfide bonds may vary from those shown in Figure 4. 

Additional examples of LIF-R/Fc and gpl30/Fc fusion proteins useful in 
preparing receptors of the present invention are described in examples 7 and 8. 
Advantageously, host cells are co-transfected with two different expression vectors, 
one encoding soluble LEF-R/Fc and the other encoding soluble gpl30/Fc. The 

25 heterodimer is believed to form intracellularly or during secretion. 

Homodimers comprising two LIF-R/Fc polypeptides or two gpl30/Fc 
polypeptides linked via disulfide bonds are also produced by certain of the transfected 
host cells disclosed herein. The LIF-R/Fc homodimers are useful for binding LDF and 
the gpl30/Fc homodimers find use in binding oncostatin M. The homodimers may be 

30 separated from each other and from the heterodimer by virtue of differences in size 
(e.g., by gel electrophoresis). The heterodimer also may be purified by sequential 
immunoaffinity chromatography (described below). 

In an alternative embodiment, a first fusion polypeptide comprising gpl30 (or 
fragment thereof) upstream of an antibody light chain (or a fragment thereof) is 

35 prepared. A second second fusion polypeptide comprises LIF-R upstream of an 
antibody heavy chain (or a heavy chain fragment, the N-terminus of which extends at 
least through the ChI region. Disulfide bond(s) form between the gpl30-light chain 
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fusion polypeptide and the LIF-R-heavy chain fusion polypeptide, thus producing a 
receptor of the present invention comprising a polypeptide linker- If desired, a.third 
fusion (an LIF-R-antibody light chain fusion polypeptide) is prepared and combined 
with (disulfide bonded to) a fourth fusion comprising gpl30 fused to an antibody 

5 heavy chain. When the two disulfide bonded molecules are combined, additional 
disulfide bonds form between the two Fc regions. The resulting receptor of the present 
mention comprising the four fusion polypeptides resembles an antibody in structure 
and displays the oncostatin M/LJF binding site bivalenuy. 

A polypeptide linker may be attached to gpl30 and to IJF-R by any of the 

10 conventional procedures used to attach one polypeptide to another. The cross-linking 
reagents available from Pierce Chemical Company as described above are among those 
mat may be employed. Amino acids having side chains reactive with such reagents 
may be included in the polypeptide linker, e.g., at the termini thereof. 

The gpl30 and IJF-R polypeptides may be separately purified from cellular 

15 sources, and then linked together. Alternatively, the receptor of the present invention 
may be produced using recombinant DNA technology. The gpl30 and LIF-R 
polypeptides may be produced separately and purified from transformed host cells for 
subsequent covalent linkage. In one embodiment of the present invention, a host cell is 
transformed/transfected with foreign DNA that encodes gpl30 and LIF-R as separate 

20 polypeptides. The two polypeptides may be encoded by the same expression vector 
with start and stop codons for each of the two genes, or the recombinant cells may be 
co-transfected with two separate expression vectors. In another embodiment, the 
receptor is produced as a fusion protein in recombinant cells. 

In one embodiment of the present invention, the receptor protein is a 

25 recombinant fusion protein of the formula: 

Rl-L-R2or R2-L-R1 

wherein Ri represents gpl30 or a gpl30 fragment; R 2 represents IJF-R or an IJF-R 
fragment; and L represents a polypeptide linker. 

30 The fusion proteins of the present invention include constructs in which the C- 

terminal portion of gpl30 is fused to the linker which is fused to the N-terminal portion 
of IJF-R, and also constructs in which the C-terrninal portion of LIF-R is fused to the 
linker which is fused to the N-terrninal portion of gp 130. gpl30 is covalently linked to 
IJF-R in such a manner as to produce a single protein which retains the desired 

35 biological activities of gpl30 and LIF-R. The components of the fusion protein are 
listed in their order of occurrence (i.e., the N-terminal polypeptide is listed first, 
followed by the linker and then the C-terminal polypeptide). 
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A DNA sequence encoding a fusion protein is constructed using recombinant 
DNA techniques to insert separate DNA fragments encoding gpl30 and LIF-R into an 
appropriate expression vector. The 3' end of a DNA fragment encoding gpl30 is 
ligated (via the linker) to the 5' end of the DNA fragment encoding LIF-R with the 
5 reading frames of the sequences in phase to permit translation of the mRNA into a 
single biologically active fusion protein. Alternatively, the 3' end of a DNA fragment 
encoding LIF-R may be ligated (via the linker) to the 5* end of the DNA fragment 
encoding gpl30, with the reading frames of the sequences in phase to permit translation 
of the mRNA into a single biologically active fusion protein. A DNA sequence 

10 encoding an N-tenninal signal sequence may be retained on the DNA sequence 
encoding the N-terminal polypeptide, while stop codons, which would prevent read- 
through to the second (C-terminal) DNA sequence, are eliminated. Conversely, a stop 
codon required to end translation is retained on the second DNA sequence. DNA 
encoding a signal sequence is preferably removed from the DNA sequence encoding the 

15 C-terminal polypeptide. 

Suitable polypeptide linkers comprise a chain of amino acids, preferably from 
20 to 100 amino acids in length and most preferably from 30 to 60 amino acids in 
length. As discussed above, the linker advantageously comprises amino acids selected 
from the group consisting of glycine, asparagine, serine, threonine, and alanine. 

20 Examples of suitable polypeptide linkers include, but are not limited to, (Gly4Ser) n , 
wherein n is 4-12, preferably 8, and (Gly4SerGlysSer)2. 

A DNA sequence encoding a desired polypeptide linker may be inserted 
between, and in the same reading frame as, the DNA sequences encoding gpl30 and 
LIF-R using any suitable conventional technique. For example, a chemically 

25 synthesized oligonucleotide encoding the linker and containing appropriate restriction 
endonuclease cleavage sites may be ligated between the sequences encoding gpl30 and 
LIF-R. 

Alternatively, a chemically synthesized DNA sequence may contain a sequence 
complementary to the 3* terminus (without the stop codon) of either gpl30 or LIF-R, 

30 followed by a linker-encoding sequence which is followed by a sequence 
complementary to the 5' terminus of the other of gpl30 and LIF-R. Oligonucleotide 
directed mutagenesis is then employed to insert the linker-encoding sequence into a 
vector containing a direct fusion of gpl30 and LIF-R. 

The present invention provides an isolated DNA sequence encoding the above- 

35 described fusion protein comprising gpl30, LIF-R, and a polypeptide linker, and also 
provides recombinant expression vectors containing the isolated DNA sequence. 
•■Expression vector" refers to a replicable DNA construct used to express DNA which 
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encodes the desired protein and which includes a transcriptional unit comprising an 
assembly of (1) genetic elements) having a regulatory role in gene expression, for 
example, promoters, operators, or enhancers, operatively linked to (2) a DNA sequence 
encoding a desired protein (in this case, a receptor of the present invention) which is 
5 transcribed into mRNA and translated into protein, and (3) appropriate transcription and 
translation initiation and termination sequences. The choice of promoter and other 
regulatory elements generally varies according to the intended host celL 

Proteins to be produced in yeast expression systems preferably include a leader 
sequence enabling extracellular secretion of translated protein by the yeast host cell. 
10 Alternatively, where recombinant protein is expressed without a leader or transport 
sequence, it may include an N-tenrrinal methionine residue. This residue opuonally 
may be subsequently cleaved from the expressed recombinant protein to provide a final 

product. . . 

In the expression vectors, regulatory elements controlling transcription or 

15 translation are generally derived from mammalian, microbial, viral or insect genes. The 
ability to replicate in a host, usually conferred by an origin of replication, and a 
selection gene to facilitate recognition of transformants may additionally be 
incorporated. Vectors derived from retroviruses also may be employed. 

DNA regions are operably linked when they are functionally related to each 

20 other. For example, DNA encoding a signal peptide (secretory leader) is operably 
linked to DNA for a polypeptide if the polypeptide is expressed as a precursor that is 
secreted through the host cell membrane; a promoter is operably linked to a coding 
sequence if it controls the transcription of the sequence; or a ribosome binding site is 
operably linked to a coding sequence if it is positioned so as to permit translation. 

25 Generally, "operably linked" means contiguous and, in the case of secretory leaders, 
contiguous and in reading frame. 

Transformed host cells are cells which have been transformed or transfected 
with foreign DNA using recombinant DNA techniques. In the context of the present 
invention, the foreign DNA includes a sequence encoding the inventive receptor. Host 

30 cells may be trarisformed for purposes of clonmg or arnplifymg me foreign DNA, or 
may be transformed with an expression vector for production of the receptor protein. 
Suitable host cells for expression of the receptor include prokaryotes, yeast or higher 
eukaryotic cells under the control of appropriate promoters. Prokaryotes include gram 
negative or gram positive organisms, for example E. coli or bacilli. Prokaryotic 

35 expression vectors generally comprise one or more phenotypic selectable markers, for 
example a gene encoding proteins conferring antibiotic resistance or supplying an 
autotrophic requirement, and an origin of replication recognized by the host to ensure 
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amplification within the host. Examples of suitable prokaryotic hosts for 
transformation include E. coli, Bacillus subtilis, Salmonella typhimurium, and various 
species within the genera Pseudomonas, Streptomyces, and Staphylococcus, although 
others may also be employed as a matter of choice. Higher eukaryotic cells include 
5 established cell lines of mammalian origin. Cell-free translation systems could also be 
employed to produce fusion protein using RNAs derived from the DNA constructs of 
the present invention. Appropriate cloning and expression vectors for use with 
bacterial, fungal, yeast, and mammalian cellular hosts are described by Pouwels et al. 
{Cloning Vectors: A Laboratory Manual, Elsevier, New York, 1985), the relevant 
10 disclosure of which is hereby incorporated by reference. 

Useful expression vectors for bacterial use can comprise a selectable marker and 
bacterial origin of replication derived from commercially available plasmids comprising 
genetic elements of the well-known cloning vector pBR322 (ATCC 37017). Such 
commercial vectors include, for example, pKK223-3 (Pharmacia Fine Chemicals, 
15 Uppsala, Sweden) and pGEMl (Promega Biotec, Madison, WI, USA). These 
pBR322 "backbone" sections are combined with an appropriate promoter and the 
structural sequence to be expressed. E. coli is typically transformed using derivatives 
of pBR322, a plasmid derived from an E. coli species (Bolivar et al., Gene 2:95, 
1977). pBR.322 contains genes for ampicillin and tetracycline resistance and this 
20 provides simple means for identifying transformed cells. 

Promoters commonly used in recombinant microbial expression vectors include 
the b-lactamase (penicillinase) and lactose promoter system (Chang et al., Nature 
275:615, 1978; and Goeddel et al., Nature 281:544, 1979), the tryptophan (trp) 
promoter system (Goeddel et al., Nucl. Acids Res. 5:4057, 1980; and EPA 36,776) 
25 and tac promoter (Maniatis, Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratory, p. 412, 1982). A particularly useful bacterial expression system 
employs the phage X Pl promoter and cI857ts thermoinducible repressor. Plasmid 
vectors available from the American Type Culture Collection which incorporate 
derivatives of the X PL promoter include plasmid pHUB2, resident in £. coli strain 
30 JMB9 (ATCC 37092) and pPLc28, resident in E. coli RR1 (ATCC 53082). 

The recombinant receptor protein may also be expressed in yeast hosts, 
preferably from Saccharomyces species, such as S. cerevisiae. Yeast of other genera 
such as Pichia or Kluyveromyces may also be employed. Yeast vectors will generally 
contain an origin of replication from the 2|xm yeast plasmid or an autonomously 
35 replicating sequence (ARS), a promoter, DNA encoding the receptor fusion protein, 
sequences for polyadenylation and transcription termination and a selection gene. 
Preferably, yeast vectors will include an origin of replication and selectable markers 
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permitting transformation of both yeast and E. coli, e.g., the ampicillin resistance gene 
of E coli and the S. cerevisiae trpl gene, which provides a selection marker for a 
mutant strain of yeast lacking the ability to grow in tryptophan, and a promoter derived 
from a highly expressed yeast gene to induce transcription of a structural sequence 
5 downstream. The presence of the trpl lesion in the yeast host cell genome then 
provides an effective environment for detecting transformation by growth in the 

absence of tryptophan. 

Suitable promoter sequences in yeast vectors include the promoters for 
metallothionein, 3-phosphogIycerate kinase (ffitzeman et aL. J. Biol. Chem. 255:2073, 

10 1980) or other glycolytic enzymes (Hess et aL, /. Adv. Enzyme Reg. 7:U9, 1968; and 
HollandetaL,5wctem-/7^900, 1978), such as enolase, glyceraldehyde-3-phosphate 
dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose-6- 
phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate 
isomerase, phosphoglucose isomerase and glucokinase. Suitable vectors and 

15 promoters for use in yeast expression are further described in R. Hitzeman et al., EPA 

73,657. c _ 

Preferred yeast vectors can be assembled using DNA sequences from pBR322 
for selection and replication in E. coli (Amp* gene and origin of replication) and yeast 
DNA sequences including a glucose-xepressible ADH2 promoter and a-factor secretion 

20 leader The ADH2 promoter has been described by Russell et al. (/. Biol. Chem. 
25S2674, 1982) and Beier et al., {Nature 300-H2A, 1982). The yeast a-factor leader, 
which directs secretion of heterologous proteins, can be inserted between the promoter 
and the structural gene to be expressed. See, eg. Kurjan et aL. Cell 30:922, 1982; 
and Bitter et aL, Proc. Natl. Acad. Sci. USA 81 :5330, 1984. The leader sequence may 

25 be modified to contain, near its 3' end, one or more useful restriction sites to facilitate 
fusion of the leader sequence to foreign genes. 

Suitable yeast transformation protocols are known to those of skill in the art. 
An exemplary technique is described by Hinnen et al., Proc. Natl. Acad. Sci. USA 
75-1929, (1978), selecting for Trp + transformants in a selective medium consisting of 

30 0.67% yeast nitrogen base, 0.5% casamino acids, 2% glucose, 10 ugAnl adenine and 
20 fig/ml uracil. 

Host strains transformed by vectors comprising the ADH2 promoter may be 
grown for expression in a rich medium consisting of 1% yeast extract, 2% peptone, 
and 1% glucose supplemented with 80 ug/ml adenine and 80 ug/ml uracil. 
35 Derepression of the ADH2 promoter occurs upon exhaustion of medium glucose. 
Crude yeast supernatants are harvested by filtration and held at 4'C prior to further 
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Various mammalian or insect cell culture systems can be employed to express 
recombinant protein. Baculovirus systems for production of heterologous proteins in 
insect cells are reviewed by Luckow and Summers, Bio/Technology 6:47 (1988). 
Examples of suitable mammalian host cell lines include the COS-7 lines of monkey 
5 kidney cells, described by Gluzman (Cell 23: 175, 198 1), and other cell lines capable of 
expressing an appropriate vector including, for examples, L cells, CI 27, 3T3, Chinese 
hamster ovary (CHO), HeLa and BHK cell lines. Mammalian expression vectors may 
comprise non-transcribed elements such as an origin of replication, a suitable promoter 
and enhancer linked to the gene to be expressed, and other 5' or 3 f flanking 
10 nontranscribed sequences, and 5* or 3' nontranslated sequences, such as necessary 
ribosome binding sites, a poly-adenylation site, splice donor and acceptor sites, and 
transcriptional termination sequences. 

The transcriptional and translational control sequences in expression vectors to 
be used in transforming vertebrate cells may be provided by viral sources. For 
15 example, commonly used promoters and enhancers are derived from Polyoma, 
Adenovirus 2, Simian Virus 40 (SV40), and human cytomegalovirus. DNA sequences 
derived from the SV40 viral genome, for example, SV40 origin, early and late 
promoter, enhancer, splice, and polyadenylation sites may be used to provide the other 
genetic elements required for expression of a heterologous DNA sequence. The early 
20 and late promoters are particularly useful because both are obtained easily from the 
virus as a fragment which also contains the SV40 viral origin or replication (Fiers et al., 
Nature 273:113, 1978). Smaller or larger SV40 fragments may also be used, provided 
the approximately 250 bp sequence extending from the Hind m site toward the BgH 
site located in the viral origin of replication is included. Exemplary vectors can be 
25 constructed as disclosed by Okayama and Berg (Mol. Cell. Biol 5:280, 1983). 

A useful system for stable high level expression of mammalian receptor cDNAs 
in C127 murine mammary epithelial cells can be constructed substantially as described 
by Cosman et al. (Mol. Immunol. 23:935, 1986). 

Particularly preferred vectors for expression of the inventive receptor as a 
30 fusion protein are described in the examples below. The foregoing discussion is, of 
course, applicable to the production of recombinant fusion proteins comprising a 
fragment of gpl30 and/or a fragment of LIF-R. Suitable fragments are discussed 
above, and DNA sequences encoding such fragments may be inserted into the above- 
described expression vectors. 
35 The present invention provides a process for preparing the recombinant receptor 

of the present invention, comprising culturing a host cell transformed with an 
expression vector comprising a DNA sequence that encodes said receptor under 
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conditions that promote expression. The receptor is then purified from culture media or 
cell extracts. 

For example, supernatants from systems which secrete recombinant protein into 
the culture medium can be first concentrated using a commercially available protein 
5 concentration filter, for example, an Amicon or MiUipore Pellicon ultrafiltration unit. 
Following the concentration step, the concentrate can be applied to a suitable 
purification matrix. For example, a suitable affinity matrix can comprise LIF or OSM. 
An UF affinity matrix may be prepared by coupling recombinant human UF to 
cyanogen bromide-activated Sepharose (Pharmacia) or Hydrazide Affigel (Biorad), 
10 according to manufacturer's recommendations. Sequential immunopurification using 
antibodies bound to a suitable support is preferred. Proteins binding to an antibody 
specific forUF-R are recovered and contacted with antibody specific for gpl30 on an 
insoluble support. Proteins irnmunoreactive with both antibodies may thus be 
identified and isolated. Alternatively, an anion exchange resin can be employed, for 
15 example, a matrix or substrate having pendant diethylaminoethyl (DEAE) groups. The 
matrices can be acrylamide, agarose, dextran, cellulose or other types commonly 
employed in protein purification. Alternatively, a cation exchange step can be 
employed. Suitable cation exchangers include various insoluble matrices comprising 
sulfopropyl or carboxymethyl groups. Sulfopropyl groups are preferred. One or more 
20 reversed-phase high performance liquid chromatography (RP-HPLQ steps employing 
hydrophobic RP-HPLC media, e.g., sffica gel having pendant methyl or other aliphatic 
groups, can be employed to further purify a fusion protein composition. 

Some or all of the foregoing purification steps, in various combinations, can be 
employed to provide an essentially homogeneous recombinant protein. Recombinant 
25 cell culture enables the production of the fusion protein free of those contaminating 
proteins which may be normally associated with gpl30 or LIF-R as they are found in 
nature in mefr respective species of origin, e.g, in cells, cell exudates or body fluids. 

The foregoing purification procedures are among those that may be employed to 
purify non-recombinant receptors of the present invention as welL When linking 
30 procedures that may produce homodimers (gpl30-linker-gpl30 and IJF-R-Hnker-LIF- 
R) are employed, purification procedures that separate the desired heterodimer from 
such homodimers are employed. An example of such a procedure is sequential 
immunopurification as discussed above. 

Recombinant protein produced in bacterial culture is usually isolated by initial 
35 extraction from cell pellets, followed by one or more concentration, salting-out, 
aqueous ion exchange or size exclusion chromatography steps. Finally, high 
performance liquid chromatography (HPLC) can be employed for final purification 
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steps. Microbial cells employed in expression of recombinant fusion proteins can 
disrupted by any convenient method, including freeze-thaw cycling, sonication, 
mechanical disruption, or use of cell lysing agents. 

Fermentation of yeast which express fusion proteins as a secreted protein 

5 greatly simplifies purification. Secreted recombinant protein resulting from a large- 
scale fermentation can be purified by methods analogous to those disclosed by Urdal et 
al. (/. Chromatog. 296:171, 1984), involving two sequential, reversed-phase HPLC 
steps for purification of a recombinant protein on a preparative HPLC column. 

The present invention also provides a pharmaceutical composition comprising a 

10 receptor protein of the present invention with a physiologically acceptable carrier or 
diluent Such carriers and diluents will be nontoxic to recipients at the dosages and 
concentrations employed. Such compositions may, for example, comprise the receptor 
protein in a buffered solution, to which may be added antioxidants such as ascorbic 
acid, low molecular weight (Less than about ten residues) polypeptides, proteins, amino 

15 acids, carbohydrates including glucose, sucrose or dextrins, chelating agents such as 
EDTA, glutathione and other stabilizers and excipients. The receptor of the present 
invention may be administered by any suitable method in a manner appropriate to the 
indication, such as intravenous injection, continuous infusion, sustained release from 
implants, etc. 

20 The DNA and/or amino acid sequences of gpl30 and LIF-R may vary from 

those presented in SEQ ID NO: 1 and SEQ ID NO: 5. Due to the known degeneracy of 
the genetic code, there can be considerable variation in nucleotide sequences encoding 
the same amino acid sequence. The DNA sequences capable of hybridizing to the 
native DNA sequence of SEQ ID NO: 1 or SEQ ID NO: 5 under moderately stringent 

25 conditions (50°C, 2 X SSC), and which encode a biologically active gpl30 or LIF-R 
polypeptide, are also considered to be gpl30-encoding or LIF-R-encoding DNA 
sequences, respectively, in the context of the present invention. Further, certain 
mutations in a nucleotide sequence which encodes LIF-R or gpl30 will not be 
expressed in the final protein product. For example, nucleotide substitutions may be 

30 made to enhance expression, primarily to avoid secondary structure loops in the 
transcribed mRNA (see EPA 75,444A, incorporated herein by reference). Other 
alterations of the nucleotide sequence may be made to provide codons that are more 
readily translated by the selected host, e.g., the well-known E. coli preference codons 
for E. coli expression. 

35 The amino acid sequence of native gpl30 or LIF-R may be varied by 

substituting, deleting, adding, or inserting one or more amino acids to produce a gpl30 
or LIF-R variant. Variants that possess the desired biological activity of the native 
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gpl30 and LIF-R proteins may be employed in the receptor of the present invention. 
Assays by which the biological activity of variant proteins may be analyzed are 
described in the examples below. 

Alterations to the native amino acid sequence may be accomplished by any of a 

5 number of known techniques. For example, mutations can be introduced at particular 
loci by synthesizing oligonucleotides containing a mutant sequence, flanked by 
restriction sites enabling ligation to fragments of the native sequence. Following 
ligation, the resulting reconstructed sequence encodes an analog having the desired 
amino acid insertion, substitution, or deletion. 

10 Alternatively, ohgonucleotide-oirected site-specific mutagenesis procedures can 

be employed to provide an altered gene having particular codons altered according to 
the substitution, deletion, or insertion required. Exemplary methods of making the 
alterations setfbrth above are disclosed by Walderet al. {Gene 42:133, 1986); Bauer et 
aL (Gene 37:73, 1985); Craig (BioTechniques, January 1985, 12-19); Smith et al. 

15 (Genetic Engineering: Principles and Methods, Plenum Press, 1981); U.S. Patent No. 
4,518,584, and U.S. Patent No. 4,737,462, which are incorporated by reference 
herein. 

Bioequivalent variants of LIF-R and gpl30 may be constructed by, for 
example, making various substitutions of amino acid residues or deleting terminal or 

20 internal amino acids not needed for biological activity. In one embodiment of the 
invention, the variant amino acid sequence is at least 80% identical, preferably at least 
90% identical, to the native sequence. Percent similarity may be determined, for 
example, by comparing sequence information using the GAP computer program, 
version 6.0, available from the University of Wisconsin Genetics Computer Group 

25 (UWGCG). The GAP program utilizes the alignment method of Needleman and 
Wunsch (J. MoL BioL 48:443, 1970), as revised by Smith and Waterman (Adv. Appl. 
Math. 2:482, 1981), Briefly, the GAP program defines similarity as the number of 
aligned symbols (Le., nucleotides or amino acids) which are similar, divided by the 
total number of symbols in the shorter of the two sequences. The preferred default 

30 parameters for the GAP program include: (1) a unary comparison matrix (containing a 
value of 1 for identities and 0 for non-identities) for nucleotides, and the weighted 
comparison matrix of Gribskov and Burgess, NucL Acids Res. 74:6745, 1986, as 
described by Schwartz and Dayhoff, eds., Atlas of Protein Sequence and Structure, 
National Biomedical Research Foundation, pp. 353-358, 1979; (2) apenalty of 3.0 for 

35 each gap and an additional 0.10 penalty for each symbol in each gap; and (3) no penalty 

for end gaps. 



WO 93/10151 



PCT/US92/10272 



21 

Generally, substitutions should be made conservatively; i.e., the most preferred 
substitute amino acids are those having physiochemical characteristics resembling those 
of the residue to be replaced. Examples of conservative substitutions include 
substitution of one aliphatic residue for another, such as Be, Val, Leu, or Ala for one 
5 another, or substitutions of one polar residue for another, such as between Lys and 
Arg; Glu and Asp; or Gin and Asn. Other such conservative substitutions, for 
example, substitutions of entire regions having similar hydrophobicity characteristics, 
are well known. Moreover, particular amino acid differences between human, murine 
and other mammali an LEF-Rs is suggestive of additional conservative substitutions that 

10 may be made without altering the essential biological characteristics of LIF-R. 

Cysteine residues can be deleted or replaced with other amino acids to prevent 
formation of unnecessary or incorrect intramolecular disulfide bridges upon 
renaturation. Adjacent dibasic amino acid residues may be modified to enhance 
expression in yeast systems in which KEX2 protease activity is present. 

15 EP212,914 discloses the use of site-specific mutagenesis to inactivate KEX2 

protease processing sites in a protein. KEX2 protease processing sites are inactivated 
by deleting, adding or substituting residues to alter Arg- Arg, Arg-Lys, and Lys-Arg 
pairs to eliminate the occurrence of these adjacent basic residues. Lys-Lys pairings are 
considerably less susceptible to KEX2 cleavage, and conversion of Arg-Lys or Lys- 

20 Arg to Lys-Lys represents a conservative and preferred approach to inactivating KEX2 
sites. 

Hydrophilic amino acids may be substituted for hydrophobic amino acids in the 
transmembrane region and/or intracellular domain of gpl30 and LIF-R to enhance 
water solubility of the proteins. Addition of amino acids to the native sequence may 
25 result from translation of in-frame codons present in linkers used in constructing 
cloning or expression vectors. The LIF-R encoded by clone pHLIF-R-65 contains 
such linker-encoded amino acids at the C-terminus, as described by Gearing et ah, 
supra. 

The present invention also includes proteins with or without associated native- 
30 pattern glycosylation. Expression of DNAs encoding the fusion proteins in bacteria 
such as E. coli provides non-glycosylated molecules. Functional mutant analogs 
having inactivated N-glycosylation sites can be produced by oligonucleotide synthesis 
and ligation or by site-specific mutagenesis techniques. These analog proteins can be 
produced in a homogeneous, reduced-carbohydrate form in good yield using yeast 
35 expression systems. N-glycosylation sites in eukaryotic proteins are characterized by 
the amino acid triplet Asn-Ai-Z, where Al is any amino acid except Pro, and Z is Ser 
or Thr. In this sequence, asparagine provides a side chain amino group for covalent 
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attachment of carbohydrate. Such a site can be eliminated by substituting another 
amino acid for Asn or for residue Z, deleting Asn or Z, or inserting anon-Z ammo acid 
between Ai and Z, or an amino acid other than Asn between Asn and Ai. Known 
procedures for inactivating N-glycosylation sites in proteins include those described in 
5 U.S. Patent 5,071,972 and EP 276,846. 

Variants of the receptor proteins of the present invention also include various 
structural forms of the primary protein which retain biological activity. Due to the 
presence of ionizable amino and carboxyl groups, for example, a receptor protein may 
be in the form of acidic or basic salts, or may be in neutral form. Individual amino acid 
10 residues may also be modified by oxidation or reduction. 

The primary amino acid structure also may be modified by forming covalent or 
aggregative conjugates with other chemical moieties, such as glycosyl groups, lipids, 
phosphate, acetyl groups and the like. Covalent derivatives are prepared by linking 
particular functional groups to amino acid side chains or at the N- or C- termini. Other 
15 derivatives of the receptor protein within the scope of this invention include covalent or 
aggregative conjugates of the receptor protein with other proteins or polypeptides, such 
as by synthesis in recombinant culture as N- or C- terminal fusions. For example, the 
conjugated polypeptide may be a signal (or leader) polypeptide sequence at the N- 
terminal region of the protein which co-translationally or post-translationaUy directs 
20 transferof me protein from its site of synthesis to its site of function inside or outside 
of the cell membrane or wall (e.g., the yeast a-factor leader). Peptides may also be 
added to facilitate purification or identification of the fusion protein (e.g., poly-His). 
The amino acid sequence of the fusion protein can also be linked to the peptide Asp- 
Tyr-Lys-Asp-Asp-Asp-Asp-Lys (DYKDDDDK) (Hopp et al., Bio/Technology d:1204, 
25 1988) The latter sequence is highly antigenic and provides an epitope reversibly bound 
by a specific monoclonal antibody, enabling rapid assay and facile purification of 
expressed recombinant protein. This sequence is also specifically cleaved by bovine 
mucosal enterokinase at the residue immediately following the Asp-Lys pairing. 
Receptor proteins capped with this peptide may also be resistant to intracellular 

30 degradation in E.colL 

The receptors of the present invention are primarily useful as oncostatm M 
binding reagents, and may be administered in vivo to inhibit a biological activity of 
oncostatm M (including signal transduction). The inventive receptors also have use as 
T.TF binding reagents. 

35 Disorders mediated by either oncostatin M or LIF may be treated by 

administering a therapeutically effective amount of the receptor of the present invention 
to a human or mammalian patient afflicted with such a disorder. A disorder is said to 
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be mediated -by oncostatin M or LIF when biologically active oncostatin M or LIF 
causes (directly or indirectly) or exacerbates the disorder. Soluble receptor proteins can 
be used to competitively bind to LIF and oncostatin M, thereby inhibiting binding of 
TJF and oncostatin M to cell surface receptors. 
5 As discussed in example 2, gpl30 has now been found to bind oncostatin M, 

although with lower affinity than the inventive receptors comprising both gpl30 and 
LIF-R. gpl30 may be administered to treat conditions mediated by oncostatin M, 
although a gpl30/LIF-R receptor of the present invention would be preferred for such a 
purpose. 

10 Oncostatin M has been reported to stimulate hematopoiesis, stimulate epithelial 

cell proliferation, increase plasmin activity (thereby inducing fibrinolysis), inhibit 
angiogenesis and supress expression of major histocompatibility complex antigens on 
endothelial cells. See PCT application WO 9109057 and European patent application 
no* 422,186. When these or other biological effects of oncostatin M are undesirable, a 

15 receptor of the present invention may be administered to bind oncostatin M. 

Oncostatin M is believed to stimulate production of the cytokine interleukin-6 
(IL-6), as reported by Brown et al., J. Immunol. 147:2175 (1991). Oncostatin M 
therefore indirectly mediates disorders associated with the presence of IL-6. IL-6 has 
been reported to be involved in the pathogenesis of AIDS -associated Kaposi's sarcoma 

20 (deWit et al., /. Intern. Med. [England] 229:539 [1991]). Binding of oncostatin M by 
a receptor of the present invention thus may be useful in treating Kaposi's sarcoma. 
Alternatively, but less preferably, gpl30 may be administered to treat Kaposi's 
sarcoma. 

Among the disorders mediated by LIF are lipoprotein metabolism defects such 
25 as atherosclerosis and obesity, as well as disorders of bone and calcium metabolism or 
disorders associated with LIF overproduction that affect hepatocytes, neurons, or 
leukocytes. The regulation of embryonic and hematopoietic stem cells by LIF may also 
be manipulated with the receptor. A soluble form of the receptor may also be used to 
treat leukemic cells which respond to UF by proliferating. LIF also may play a role in 
30 inducing cachexia in cancer or AIDS patients. The receptor, or antibodies thereto, may 
also be useful as a diagnostic reagent to detect diseases characterized by the presence of 
abnormal LIF-R. 

Oncostatin M and LIF are different proteins, but share certain structural and 
biological properties. If inhibition of a biological activity shared by oncostatin M and 
35 LIF is desired, the receptor of the present invention offers the benefit of binding both of 
these proteins exhibiting the particular biological activity. A receptor binding only one 
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of the proteins would leave the other protein active and continuing to mediate the 
disorder. 

Receptor proteins or derivatives thereof may also be used as reagents in 
receptor-based immunoassays, reagents in assays for oncostatin M or LIF, or as 
5 binding agents for affinity purification of oncostatin M or LEF. The receptor proteins of 
the present invention may be used as immunogens in conventional procedures for 
production of polyclonal or monoclonal antibodies. Such antibodies may be employed 
on immunoaffinity columns for purification of the receptor, or as components of 
diagnostic or research assays. Derivatives may also be obtained by attacking additional 

10 polypeptide^), e.g., by using a cross-linking agent, such as N-maleimidobenzoyl 
succinimide ester that reacts with cysteine and lysine residues. Receptor proteins may 
also be covalently bound through reactive side groups to various insoluble substrates, 
such as cyanogen bromide-activated, bisoxirane-activated, carbonyldiimidazole- 
activated or tosyl-activated agarose structures, or by adsorbing to polyolefin surfaces 

15 (with or without glutaraldehyde cross-linking). 

The following examples are provided to illustrate certain embodiments of the 
invention, and are not to be construed as limiting the scope of the invention. 



KXAMPLES 

Example 1 

Assay to Detect Binding of LIF 



Recombinant human LIF is expressed in yeast and purified to homogeneity 
25 essentially as described by Hopp, et aU Bio/Technology 5:1204 (1988). The purified 
protein is radiolabeled using a commercially available enzymobead radioiodination 
reagent (BioRad). In this procedure 10 Jig LIF in 50 \d 0.2 M sodium phosphate, pH 
7.2, are combined with 50jil enzymobead reagent, 2 mCi of sodium iodide in 20^1 of 
0.05 M sodium phosphate pH 7.0 and 10 nl of 25% B-D-glucose. After 10 minutes at 
30 25*C, sodium azide (20 |il of 50 mM) and sodium metabisulfite (10 |ilof 5 mg/ml) are 
added and incubation is continued for 5 min. at 25°C. The reaction mixture is 
fractionated by gel filtration on a 2 ml bed volume of Sephadex® G-25 (Sigma) 
equilibrated in Roswell Park Memorial Institute (RPMI) 1640 medium containing 2.5% 
(w/v) bovine serum albumin (BS A), 0.2% (w/v) sodium azide and 20 mM Hepes pH 
35 7.4 (b indin g medium). The final pool of 125 I-LIF is diluted to a working stock 
solution of 3 x 10" 8 M in binding medium and stored for up to one month at 4*C 
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without detectable loss of receptor binding activity. The specific activity is routinely in 
the range of 6-8 x 10 15 cpm/mmole LIF. 

The radiolabeled LIF may be employed in any of a number of conventional 
assay procedures to determine whether a given protein or cell binds LIF. Examples of 
5 such assays are those that detect binding of the radiolabeled LIF to cells expressing an 
LIF-binding protein on the cell surface. The radiolabeled LIF also may be employed in 
assays for the presence of LIF-binding proteins in cell culture medium (e.g. LIF- 
binding proteins secreted by recombinant cells). Proteins in cell extracts (e.g. from 
recombinant cells) also may be assayed for the ability to bind the radiolabeled LIF. 

10 In one assay procedure, cells transformed/transfected with an expression 

system encoding a protein to be tested for ability to bind LIF are plated at a density of 2 
x lO^ cells/well in either 6 well dishes (Falcon) or single well chambered slides (Lab- 
Tek). Both dishes and slides are pretreated with 1 ml human fibronectin (10 ug/ml in 
PBS) for 30 minutes followed by 1 wash with PBS. After 48 to 72 hours, cells are 

15 assayed for expression of LIF binding proteins by binding radioiodinated LIF using the 
following slide autoradiography technique. Transfected cells are washed once with 
binding medium (RPMI media 1640 containing 25 mg/ml bovine serum albumin 
(BSA), 2 mg/ml sodium azide, 20 mM HEPES, pH 7.2, and 50 mg/ml nonfat dry milk 
(NFDM) and incubated for 2 hours at 4°C with 1 ml binding medium + NFDM 

20 containing 1.25 x 10"9 M 125 I-LIR After incubation, cells in the chambered slides are 
washed three times with binding buffer + NFDM, followed by 2 washes with PBS, pH 
7.3, to remove unbound 125 I-LIF. The cells are fixed by incubating for 30 minutes at 
room temperature in 10% gluteraldehyde in PBS, pH 7.3, washed twice in PBS, and 
air dried. The slides are dipped in Kodak NTB-2 photographic emulsion (5x dilution in 

25 water) and exposed in the dark for 12 hours to 7 days at 4*C in a light proof box. The 
slides are then developed for approximately 5 minutes in Kodak D19 developer (40 
g/500 ml water), rinsed in water and fixed in Agfa G433C fixer. The slides are 
individually examined with a microscope at 25-40x magnification and positive cells that 
bind LIF are identified by the presence of autoradiographic silver grains against a light 

30 background. 

Cells in the 6 well plates are washed once with binding buffer + NFDM 
followed by 3 washings with PBS, pH 7.3, to remove unbound 125 I-LDF. The cells 
are then trypsinized to remove them from the plate and bound 125 I-LIF is counted on a 
gamma counter. 

35 The cells in transfectant pool(s) testing positive are subdivided into smaller 

pools and the screening process is repeated (with further subdividing of the pools as 
necessary) until an individual clone expressing LIF-binding protein is isolated. Non- 
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specific binding of 125 I-UF may be measured in the presence of 200-fold or greater 
excess of unlabeled LIF. As a control, the same host cells transfected with a vector 
lacking LIF-R-encoding sequences should be assayed to determine whether 
background endogenous LIF receptors are present on the host cells. 
5 In another assay procedure, cells producing a soluble LIF-binding protein that 

is released from the cells into the culture medium may be identified. Cells are collected 
by centrifugation from a culture broth. The supernatant (culture medium) is 
concentrated 10-fold, and I ul aliquots are spotted onto nitroceUulose filters and 
allowed to air dry. Additional binding sites are blocked by overnight incubation at 4*C 
10 in the above-described binding medium containing 3% non-fat dry milk (BMNFDM). 
Filters are incubated for 2h at 4*C in BMNFDM containing 1 nM 125 I-L1F in the 
presence or absence of 200 nM unlabeled LIF, then washed (3 x 5 min) in PBS. 
Filters are exposed to photographic film for 48 hr at room temperature. 

The results of one LIF binding assay conducted according to the following 
15 procedure are shown in Figure 1. Host cells transfected with vector(s) encoding LIF-R 
or gpl30 as described below were assayed for the ability to bind LIF. The host cells 
were the monkey kidney cell line designated COS-7, described by Glutzman, Cell 
23iY15 (1981). In separate transfections, COS-7 cells were transfected with the 
following combinations of vectors. The different types of transfected cells (and non- 
transfected control cells) are designated A-F as shown below, and the curves 
representing the LIF-binding assay data for each transfected or control cell type are also 
labeled A-F in Figure 1. 



20 



(A) B10G (the gpl30 encoding vector described in example 3) and 
25 pHLIFR-65 (the LIF-R encoding vector described in example 3) 

(B) pHLIFR-65 and control vector CAV (a control vector that does not 
encode LIF-R or gpI30; controls for plasmid dilution so that results can be more 
accurately compared with those of COS-7 cells co-transfected with both a gpl30 

30 encoding vector and an LIF-R encoding vector) 

(C) B10G and pHLIFR-65; transfected cells were preincubated with 
non-radiolabeled oncostatin M before incubation with 125 I-LIF 

35 (D) pBQJFR-65 and CAV; transfected cells were preincubated with non- 

radiolabeled oncostatin M before incubation with 125 I-LIF 
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(E) non-transfected COS-7 cells (control) 

(F) B10G and CAV 

5 The assay was performed by a phthalate oil separation method essentially as 

described by Dower et aL, 7. Immunol 752:751 (1984) and Park et aL, 7. Biol. Chem. 
261:4177 (1986). Briefly, the COS-7 host cells were released from 10 cm tissue 
culture plates two days after transfection by incubation in non-enzymatic cell 
dissociation buffer (Sigma) at 37°C for 30-60 minutes. Cells were then washed with 
10 the above-described binding medium and resuspended in binding medium at SxlO 6 
cells/ml. 50\sl aliquots of the cells were incubated with serial dilutions of 125 I-LIF at 
room temperature for one hour with agitation (in the presence or absence of a 200-fold 
excess of unlabeled LIF) in a total volume of 150^1. The unlabeled LIF allowed for 
calculation of the non-specific background binding of LIF. Duplicate aliquots (60pJ) of 
15 each incubation mixture were then transferred to a polyethylene centrifuge tube 
containing a phthalate oil mixture comprising 1.5 parts dibutylphthalate to 1 part bis(s- 
ethylhexyl)phthalate. 

The cells were separated from unbound 125 I-LEF by centrifugation for five 
minutes at 15,000 x g in an Eppendorf microfuge. The centrifuge tubes were cut to 
20 separate the pellet of cells (containing bound l 2 5l-LDF) from the supernatant containing 
unbound 125 T-T IF. The radioactivity in both parts was then determined on a gamma 
counter. The determinations of both cell-bound and unbound radioactivity from the 
two 60|il aliquots were averaged for subsequent calculations. 

The results are presented in Figure 1 as standard Scatchard transformations of 
25 the biological data. The data are reported as the ratio of molecules of 125 I-LIF bound 
per cell, to free 125 I-LIF molecules (y-axis) versus molecules of 125 I-LIF bound per 
cell (x-axis). The dissociation constants (Kd) are shown in Figure 1, along with the 
number of LIF-binding sites per cell. Since a saturating amount of radiolabeled LIF 
was offered, the number of molecules of radiolabeled LIF bound per cell is considered 
30 equivalent to the number of UF binding sites per cell. 

As shown by curve A of Figure 1, COS-7 cells co-transfected with a gpl30 
encoding vector (B10G) and an LIF-R encoding vector (pHLEFR-65) demonstrated 
high affinity LDF binding (Kd=9x10' 10 M) . When these same co-transfected COS-7 
cells were preincubated with non-radiolabeled oncostatin M before incubation with 
35 125 I-LIF (curve C), binding of LIF was greatly reduced (Kr>=4.2xlO- 9 M). Oncostatin 
M thus competes with LIF for binding sites on these transfected cells. 
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COS-7 cells transfected with a vector encoding the single-polypeptide chain 

LIF-R (pHLIFR-65) and with the control vector CAV bound LIF (curve B; 

K D =2.4xlO- 9 M), but with lower affinity than the cells producing both gpl30 andUF- 

R. The COS-7 cells display endogenous high affinity simian LJF receptors (curve E: 
5 K D about 3X10-UM). Transection with pHLIFR-65 (encoding the single polypeptide 

LIF-R) results in display of additional low affinity LIF receptors (K D =2.4xl0-*M; 

curve B, site 2) as well as the simian LIF receptors Kd=3.3x10-HM; curve B, site 1). 
When the COS-7 cells transfected with pHLIFR-65 and CAV were 

preincubated with non-radiolabeled oncostatin M before incubation with ^I-LIF 
10 (CURVE D), binding of LIF to the LIF-R expressed by pHLIFR-65 was essentially 

unchanged compared to the same transfected cells not preincubated with oncostatin M. 

Oncostatin M thus does not compete with LIF for binding to the single polypeptide 

chain LIF-R However, the binding of LIF to the endogenous simian high affinity 

LIF-R on the COS-7 cells was competed. 
15 The COS-7 cells co-transfected with the gpl30 encoding vector and the CAV 

control vector (curve F) did not bind LIF in any measurable amount above the amount 

of binding to the non-transfected COS-7 cells (curve E). 

Example 2 

20 A ssfl v to Det p^t Binding of Oncostatin M 

Oncostatin M may be purified from cells in which the protein is naturally found, 
or from cells transformed with an expression vector encoding oncostatin M. One 
source of oncostatin M is phorbol ester-treated U937 cells, as described by Zarling et 

25 al. PNAS USA. 83:9739 (1986). Purification of recombinant oncostatin M is 
described by Linsley et aL, /. Biol. Chem. 2(54:4282-4289 (1989), which is hereby 
incorporated by reference in its entirety. 

Preferably, oncostatin M is produced in yeast cells transformed with a suitable 
expression vector. A DNA sequence encoding a signal sequence (e.g., a yeast alpha- 

30 factor leader sequence) may be fused to the N-ternrinus of the oncostatin M encoding 
DNA sequence to promote secretion of the protein from the host cells. The protein 
when initially produced may also comprise an N-terminal identification leader (e.g., a 
"flag" sequence such as Asp-Tyr-Lys-Asp 4 -Lys) as described by Hopp et aL, 
Bio/Technology 6:1204 (1988). The flag sequence is highly antigenic and provides an 

35 epitope reversibly bound by a specific monoclonal antibody, enabling facile purification 
of the expressed recombinant protein. This sequence is also specifically cleaved by 
bovine mucosal enterokinase at the residue immediately following the Asp-Lys pairing. 
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Neither the .signal sequence nor the flag sequence is found on the processed final 
oncostatin M product. 

Oncostatin M may be radiolabeled using any suitable conventional procedure, 
such as the radioiodination procedure employed to radiolabel LIF in Example 1. The 
5 radio-iodination of oncostatin M has also been described by Linsley et aL, supra. 

The resulting radiolabeled oncostatin M may be substituted for radiolabeled LIF 
(using the same concentrations and other reaction parameters) in the assay procedures 
described in Example 1 in order to detect proteins and cells that bind oncostatin M. An 
assay for binding of ^Sj-oncostatin M to cells is also described in Linsley et aL, 
10 supra. 

The results of one oncostatin M binding assay are shown in Figure 2. COS-7 
cells transfected with vectors) encoding gpl30 or LEF-R were assayed for the ability to 
bind oncostatin M. In separate transfections, COS-7 cells were transfected with the 
following combinations of vectors. The different types of transfected cells (and non- 
15 transfected control cells) are designated A-E as shown below, and the corresponding 
curves representing the oncostatin M binding assay data for each cell type are also 
labeled A-E in Figure 2. 

(A) B10G (the gpl30 encoding vector described in example 3) and 
20 pHLIFR-65 (the LIF-R encoding vector described in example 3) 

(B) B10G and pHLJFR-65; transfected cells were preincubated with 
non-radiolabeled LEF before incubation with 125 I-oncostatin M 

25 (C) pHLIFR-65 and CAV (a control vector that does not encode LIF-R 

or gpl30; controls for plasmid dilution so that results can be more accurately compared 
with those of COS-7 cells co- transfected with both a gpl30 encoding vector and an 
LIF-R encoding vector) 

30 (D) non-transfected COS-7 cells (control) 

(E) B10G and CAV 

The assay was performed by the phthalate oil separation method described in 
35 example 1 (but substituting oncostatin M for LIF). The results are presented in Figure 
2 as standard Scatchard transformations of the biological data. The data are reported as 
the ratio of molecules of 125 I-oncostatin M bound per cell, to free 125 I-oncostatin M 
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molecules (y-axis) versus molecules of ^I-oncostatin M bound per cell (x-axis). The 
dissociation constants (K D ) are shown in Hgure 2, along with the number of oncostatin 
M-binding sites per celL Since a saturating amount of radiolabeled oncostatin M was 
offered, the number of molecules of radiolabeled oncostatin M bound per cell is 
5 considered equivalent to the number of oncostatin M binding sites per celL 

As shown by curve A in Hgure 2, COS-7 cells co-transfected with a gpl30 
encoding vector (B 10G) and an UF-R encoding vector (pHLIFR-65) demonstrated the 
ability to bind oncostatin M with high affinity (K D -2.4xl0- I0 M). 

COS-7 cells co-transfected with a vector encoding the single-polypeptide chain 
1 0 IJF-R (pHUFR-65) and with the control vector CAV (curve C) did not bind oncostatin 
M in any significant amount above that bound by the non-transfected COS-7 cells 

(curve D). . 

COS-7 cells co-transfected with pHLIFR-65 and B 10G and premcubated with 
non-radiolabeled LIF before incubation with ^I-oncostatin M (curve B) did not bind 
oncostatin M in any measurable amount above that bound by the non-transfected COS- 
7 cells. OF thus competes with oncostatin M for binding sites on the recombinant 
cells. 

The experimental conditions of this assay (the results of which are shown in 
Rgure 2) were not appropriate for accurate detection of low affinity oncostatin M 
receptors. Thus, a separate experiment (phthalate oil separation method) was 
conducted to compare oncostatin M binding by COS-7 cells transf ec ted with B10G 
alone (no CAV control vector) with oncostatin M binding to non-transfected COS-7 
cells Non-transfected COS-7 cells assayed as a control demonstrated a small number 
of high affinity oncostatin M receptors (Kd=3.6x10-10M). The cells transfected with 
B10G demonstrated additional low affinity binding of oncostatin M (Kd=7.7x10-9M). 
The results of this oncostatin M binding assay are shown in Rgure 3 as Scatchard 
transformations of the biological data. The data are reported as the ratio of molecules of 
ISST-oncostatin M bound per cell, to free ^i^ncostatin M molecules (y-axis) versus 
molecules of 1251-oncostatin M bound per cell (x-axis). The scale in Figure 3 differs 
30 from that of Figures 1 and 2 so that the difference in oncostatin M binding by the 
gpl30-producing cells compared to the control cells can be more readily visualized 

Disorders mediated by oncostatin M thus may be treated by administering 
gpl30 or a fragment thereof. Receptors comprising both gpl30 and LIF-R are 
preferred for use in treating such conditions, however, in view of the higher affinity of 
35 such receptors for oncostatin M compared to the affinity of gpl30 alone for oncostatin 
M. gpl30 also may be employed as an oncostatin M-binding reagent in diagnostic and 
research assays. 



25 
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Example 3 

Preparation of a Recombinant Fusion Protein Design ated LIF-R-Linker-gp 1 30 

5 A recombinant receptor protein of the present invention is prepared by the 

following procedure. The receptor comprises an LIF-R fragment at the N-terminus 
attached to a gpl30 fragment through a polypeptide linker. The polypeptide linker is of 
the formula (Gly4Ser)s. An oligonucleotide encoding a portion of the linker sequence, 
i.e., the sequence Ser(Gly4Ser)6Gly is synthesized by any of the conventional known 

10 procedures for oligonucleotide synthesis. The DNA and encoded amino acids 
sequences of the double-stranded oligonucleotide are as foDows: 

SEQIDNO: 7 
15 Bam HI 

5» GATCC GGT GGA GGT GGT TCT GGT GGA GGT GGT TCA GGT GGT GGA GGA TCA 
3« G CCT CCT CCA CCA AGA CCA CCT CCA CCA AGT CCA CCA CCT CCT AGT 
Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 

20 

BspMII Xbal 

GGA GGT GGT GGA TCA GGT GGA GGA GGT TCT GGA GGT GGA GGT TCC GGA T 3' 
25 CCT CCA CCA CCT AGT CCA CCT CCT CCA AGA CCT CCA CCT CCA AGG CCT AGATC 
5' 

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly 

The remaining portion of the linker is added during vector construction as described 
30 below. This oligonucleotide as well as those discussed below may be synthesized on 
an automated DNA synthesis machine such as those available from Biosearch, Inc., 
San Rafael, California or Applied Biosystems. 

The linker encoding oligonucleotide is cloned into a vector that preferably 
contains multiple restriction endonuclease cleavage sites that may be employed for 
35 inserting the sequences encoding LIF-R and gpl30 on either side of, and in the same 
reading frame as, the sequence encoding the linker. One such vector is designated 
pBLUESCRIPT SK® which is available from Stratagene Cloning Systems, La Jolla, 
California. This plasmid vector is replicable in E, coli and contains a polylinker 
segment that includes 21 unique restriction sites. The plasmid is digested with the 
40 restriction enzymes BamHI and Xbal and the linker-encoding oligonucleotide is ligated 
into the vector using conventional techniques. A recombinant vector containing the 
inserted oligonucleotide sequence is identified by restriction endonuclease analysis and 
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sizing by geL electrophoresis. A DNA sequence encoding LIF-R is inserted into the 
pBLUESCRIPT SK® vector upstream of the linker- encoding oligonucleotide and a 
DNA sequence encoding gpl30 is inserted downstream of the linker sequence. cDNA 
molecules encoding soluble fragments of LIF-R and gpl30 were isolated and amplified 
5 using the well known polymerase chain reaction (PCR) procedure. The following 
oligonucleotides were synthesized for use in the PCR procedures: 

SEQ ID NO: 8 (Oligonucleotide No. 1) 

in Sail 

AU 5 . GftTATGTCGACGATGATGGATATTTACGTATGTTTG 3' 



15 SEQ ID NO: 9 (Oligonucleotide No. 2) 

3 « CATACATACACCACTGTTTCCTTTTAAGACCTCCTCCACCISSSTACG 



20 



30 



SEQ ID NO: 10 (Oligonucleotide No. 3) 



25 5' CGCGTCCG^GAGGTGGATCTGAACTTCTAGATCCATGTGGTTATATC 3 



SEQ H> No. 1 1 (Oligonucleotide No. 4) 

3 « CAAACGAGTTCCTCTTTA&CTTATCCGCCGGCGTACG 



OKgonucleotides 1 and 2 are used in a PCR reaction to isolate a soluble 
fragment of LIF-R The template employed in the reaction is the human LIF-R cDNA 

35 cloned as described by Gearing et al. supra. The DNA and encoded amino acid 
sequences of the cDNA clone are represented in SEQ ID NO: 5. The cloning vector 
which contains this human LIF-R cDNA clone was deposited in E. coli host cells with 
the American Type Culture Collection, RockviUe, Maryland, U.SA. on December 11, 
1990, under the name pHLIFR-65 (ATCC Accession Number 68491). The deposit 

40 was made under the conditions of the Budapest Treaty. The 5' primer is 
oligonucleotide No. 1, which includes a DNA sequence encoding the first 8 ainino 
acids of the signal sequence of LIF-R and also comprises upstream sequences that 
introduce a Sal 1 restriction endonuclease cleavage site. OKgonucleotide No. 1 is 
capable of annealing to the (-) strand that is complementary to nucleotides 179-202 of 

45 SEQ ID NO: 5. The 3* primer is oligonucleotide No. 2, which contains a sequence 



WO 93/10151 



PCT/US92/10272 



33 

complementary to nucleotides 2651-2677 of SEQ ID NO: 5 (i.e., includes anti-sense 
nucleotides encoding the last nine amino acids of the extracellular domain of LJF-R). 
Immediately downstream of the LIF-R encoding sequence, oligonucleotide No. 2 
contains a sequence encoding (Gly)4 Ser, and also introduces a BamHI restriction 

5 endonuclease cleavage site. 

A PCR reaction employing oligonucleotides Nos. 1 and 2 thus isolates and 
amplifies a DNA sequence encoding an LIF-R fragment containing the entire signal 
sequence and the entire extracellular domain but lacking the transmembrane region and 
the extracellular domain. The (Gly)4 Ser sequence attached to the 3' terminus of the 

10 LIF-R fragment is part of the polypeptide linker in the final construct 

Any suitable PCR procedure may be employed. One such procedure is 
described in Sarki et al., Science 239:487 (1988). Another is described in 
Recombinant DNA Methodology, Wu et al., eds., Academic Press Inc., San Diego 
(1989), pp. 189-196. In general, PCR reactions involve combining the 5' and 3' 

15 nucleotides with the template DNA and each of the four deoxynucleoside triphosphates 
in a suitable buffered solution. The solution is heated, (e.g, from 95° to 100*C) to 
denature the double-stranded DNA template and is then cooled before addition of a 
DNA polymerase enzyme. Multiple cycles of the reactions are carried out in order to 
amplify the desired DNA fragment 

20 An example of a suitable PCR procedure is as follows. All temperatures are in 

degrees centigrade. The following PCR reagents are added to a 1.5 ml Eppendorf 
microfuge tube: 10 |il of 10X PCR buffer (500 mM KC1, 100 mM Tris-HCl, pH 8.3 
at 25'C, 25 mM MgCl2, and 1 mg/ml gelatin) (Perkin-Elmer Cetus, Norwalk, CN), 10 
|il of a 2 mM solution containing each dNTP (2 mM dATP, 2 mM dCTP, 2 mM dGTP 

25 and 2 mM dTTP), 2.5 units (0.5 \il of standard 5000 units/ml solution) of Tag DNA 
polymerase (Perkin-Elmer Cetus), 50 ng of template DNA, 5 |il of a 20 |iM solution of 
each of oligonucleotide primers 1 and 2, and 74.5 |0l water to a final volume of 100 pi 
The final mixture is then overlaid with 100 |il parafin oil. PCR is carried out using a 
DNA thermal cycler (Ericomp, San Diego, CA) by initially denaturing the template at 

30 94° for 90 seconds, reannealing at 55° for 75 seconds and extending the cDNA at 72° 
for 150 seconds. PCR is carried out for an additional 20 cycles of amplification using a 
step program (denaturation at 94°, 25 sec; annealing at 55°, 45 sec; extension at 72°, 
150 sec), followed by a 5 minute extension at 72°. 

The sample is removed from the parafin oil and DNA extracted by 

35 phenolchlorofonn extraction and spun column chromatography over G-50 (Boehringer 
Mannheim), A 10 |±1 aliquot of the extracted DNA is separated by electrophoresis on 
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1% SeaKem™ agarose (FMC BioProducts, Rockland, ME) and stained with ethidiutn 
bromide to confirm that the DNA fragment size is consistent with the predicted product. 

The PCR-amplified cDNA products are then digested with Sail and BamHI 
restriction enzymes using standard procedures. The Sall/BamHI restriction fragment is 
5 then separated by gel electrophoresis, e.g., on a L2% Seaplaque™ low gelling 
temperature (LGT) agarose, and the band representing the desired fragment is isolated. 
The fragment is inserted into a vector encoding the desired fusion protein as described 

below. . , ■ r, 7- 

A plasmid vector containing human gpl30 cDNA was deposited in E. coh 

10 strain DH5ahost cells with the American Type Culture Collection, Rockville, 
Maryland under the name B10G/pDC303 (DH5a) on November 14, 1991 and 
assigned ATCC Accession No. 68827. The deposit was made under the conditions of 
the Budapest Treaty. The DNA and encoded amino acid sequences of this cloned 
cDNA are shown in SEQ ID NO: 1. 

15 Oligonucleotides 3 and 4 are employed in the polymerase chain reaction 

procedure to amplify and isolate a DNA fragment encoding Ser(Gly) 4 Ser followed by 
amino acids 1 to 597 of SEQ ID NO: 1 (me entire extraceUular domain of themature 
gpl30 protein). The 5' primer, oligonucleotide No. 3, includes nucleotides 310 to 336 
of SEQ ID NO: I, which encode the first nine amino acids of the mature gpl30 

20 protein. This nucleotide sequence is capable of annealing to the (-) strand that is 
complementary to nucleotides 310 to 336 of SEQ ID NO: 1. Oligonucleotide No. 3 
also encodes a Ser(Gly) 4 Ser sequence directly upstream of (and in the same reading 
fame as) the gpl30 sequence, and further positions aBspMH restriction endonuclease 
cleavage site near the 5« terminus of the Ser(Gly) 4 Ser -encoding sequence. 

25 The 3' primer, oligonucleotide No. 4, includes a sequence complimentary to 

nucleotides 2080 to 2100 of SEQ ID NO: 1, i.e., includes anti-sense nucleotides 
encoding the last seven amino acids of the gpl30 extracellular domain. Oligonucleotide 
No. 4 positions a stop codon immediately after the gpl30 sequence and also inserts a 
Notl restriction site downstream. Following amplification of the gpl30 fragment by 

30 PCR, the PCR reaction products are digested with BspMH and Notl and the desired 

fragment is isolated. 

The above-described LIF-R, Ser(Gly 4 Ser) 6 Gly linker, and gpl30 encoding 
fragments are assembled into a single DNA sequence as follows. The 
Ser(Gly 4 Ser)6Gly linker fragment is excised from the pBLUESCRUT SK® vector by 
35 digestion with BamHI and BspMH. The linker fragment is then ligated to the 3' end of 
the UF-R fragment (cleaved at its 3* terminus after the Gly 4 Ser sequence with 
BamHI). The ligation is conducted under conventional conditions. The 3' end of the 
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linker fragment is ligated to the BspMQ-cleaved 5' end of the gp 130 fragment. The 
resulting DNA fragment encodes a receptor of the present invention comprising (from 
5* to 3') the signal sequence and extracellular domain of LIF-R attached to a (Gly4Ser)s 
polypeptide linker which is attached to the mature coding sequence of the gpl30 
5 extracellular domain. 

This DNA fragment may be inserted into any suitable cloning and/or expression 
vector. For example, the pBLUESCRIPT SK® vector may be digested with Sail and 
NotI and the ligated DNA fragment inserted therein. E. coli cells are then transformed 
with the recombinant vector by conventional procedures. 

10 In an alternative procedure, the pBLUESCRIPT SK® vector containing the 

Ser(Gly4Ser)6Gly linker sequence is digested with Sail and BamHI and the above 
described LIF-R-encoding fragment is inserted therein. The resulting vector is then 
digested with BspMII and NotI and the gpl30-encoding fragment is then inserted to 
form the DNA sequence encoding the receptor of the present invention. The cloned 

15 receptor-encoding DNA fragment may be excised and inserted into any suitable 
expression vector (chosen according to the type of host cell that is desired) using 
conventional procedures. Host cells transformed with the recombinant expression 
vector are cultivated to produce the receptor protein. Mammalian host cells are 
generally preferred for producing the recombinant receptor fusion proteins of the 

20 present invention. 

The receptor-encoding construct may be excised by Sail and NotI digestion and 
inserted into a vector suitable for use in mammalian host cells. One suitable vector is 
designated pDC406. cDNA molecules inserted at the Sail site of this vector are 
transcribed and translated using regulatory elements derived from HTV and adenovirus. 

25 pDC406 contains origins of replication derived from S V40, Epstein-Bair virus and 

pBR322. The pDC406 vector into which interleukin-1 receptor cDNA has been cloned 
has been deposited with the American Type Culture Collection, Rockville, Maryland 
USA under accession number CRL10478. The interleukin-1 receptor cDNA may be 
excised from the vector using conventional techniques and replaced with the receptor 

30 encoding DNA of the present invention prepared above. pDC406 is a derivative of 
HAV-EO described by Dower et al., /. Immunol 142:4314 (1989). pDC406 differs 
from HAV-EO by the deletion of the intron present in the adenovirus 2 tripartite leader 
sequence in HAV-EO. 

Examples of suitable mammalian cells for expressing a receptor fusion protein 

35 include CV-1 cells (ATCC CCL70) and COS-7 cells, (ATCC CRL 1651) both derived 
from monkey kidney. Another monkey kidney cell line CV-1/EBNA (ATCC CRL 
10478) was derived by transfection of the CV-1 cell line with a gene encoding Epstein- 
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Bair virus nuclear antigen-1 (EBNA-1) and with a vector containing CMV regulatory 
sequences. See McMahanetaL, £M»a/. 20:2821 (1991). The EBNA-1 gene allows 
for episomal replication of expression vectors, such as HAV-EO or pDC406, that 
contain the EBV origin of replication. 

5 

Example 4 

re paration pf " E&g^aasm BSSS&SSEssism Protein Designating 
" ppl^n-T-inlcer-LlF-R 

10 This receptor of the present invention differs from that of Example 3 in that the 

UF-Rpolypeptide (which was the 5' polypeptide in the receptor of Example 3) is now 
the 3' polypeptide. The following oligonucleotides were synthesized for use in 
preparing the fusion protein: 

15 SEQIDNO:12 

5* GAXATGTCGACAAGATGTTGACGTTGCAG&CTTGG 3* (oligOnUCleonde no. 5) 

SEQIDNO:13 

3* CAAACGaGTTCCTCTTTAACTTCCTCCTCCACCTAGGTACG 5' (oligOnUCleOUde no. 6) 

20 

SEQIDNO:14 

5« CGCGTCCGGAGGAGGTGGTAGCCAGAAAAAGGGGGCTCCTCATG 3' (oligonucleotide 

no. 7) 
25 SEQIDNO: 15 

3- CATACATACACCACTGTTTCCTTTTAAGAATCGCCGGCGTACG 5' (otigOmicleOUde no. 

8) 

Ohgonucleotides 5 and 6 are employed in a polymerase chain reaction procedure 
30 to isolate a fragment of gpl30. The 5 1 primer (oligonucleotide number 5) includes 
nucleotides 244 to 264 of SEQ ID NO: 1 , (the sequence encoding the first seven amino 
acids of the gpl30 signal sequence). Oligonucleotide number 5 also includes a 
sequence that introduces an upstream Sail site. This nucleotide sequence is capable of 
annealing to the (-) strand that is complementary to nucleotides 244 to 264 of SEQ ID 
35 NO: 1. The 3' primer (oligonucleotide number 6) includes a sequence complementary 
to nucleotides 2080 to 2100 of SEQ ID NO: 1, Le. includes antisense nucleotides 
encoding the last seven amino acids of the gp-130 extracellular domain. 
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Oligonucleotide number 6 also encodes a Gly4Ser sequence immediately 3' to (and in 
phase with) the gpl30 sequence, and also inserts a downstream BamHI site. 

A PCR reaction is conducted as described in Example 3 but employing 
oligonucleotides 5 and 6 on the gp 130 cDNA template. A DNA sequence encoding a 
5 gpl30 fragment that includes the 5' signal sequence and the entire extracellular domain, 
but none of the transmembrane region or the cytoplasmic domain, is isolated by the 
PCR reaction. A Gly4Ser sequence is fused to the 3' terminus of the gp 130 fragment. 
The PCR reaction products are digested with Sail and BamHI and the desired fragment 
is isolated. 

10 An LIF-R fragment is isolated and amplified by a PCR reaction employing 

oligonucleotides 7 and 8. The 5" primer (oligonucleotide number 7) includes 
nucleotides 311 to 331 of SEQ ID NO: 5, which encode the first seven amino acids of 
the mature LIF-R protein. This nucleotide sequence is capable of annealing to the (-) 
strand complementary to nucleotides 311 to 331 of SEQ ID NO: 5. Oligonucleotide 

15 number 7 also encodes a Gly4Ser sequence fused to the 5' end of the LIF-R sequence, 
and inserts an upstream BspMII site. The 3* primer (oligonucleotide number 8) is 
complementary to nucleotides 2651 to 2677 of SEQ ID NO: 5 (which encode the last 
nine amino acids of the LIF-R extracellular domain.) Oligonucleotide number 8 also 
adds a stop codon at the 3' end of the LIF-R sequence, and inserts a Not I site 

20 downstream. The PCR reaction products are digested with BspMII and NotI and the 
desired fragment is isolated. 

A DNA sequence encoding the desired receptor protein is prepared by ligating 
the BamHI site of the gpl30 fragment prepared above to the BamHI site at the 5 1 
terminus of the linker fragment described in Example 3. Likewise the C-terminus of 

25 the linker encoding fragment is ligated at the BspMII site to the complementary site of 
the LEF-R encoding fragment prepared above. The resulting DNA fragment may be 
cloned into an expression vector using procedures described in Example 3. The 
receptor encoded by the isolated DNA fragment comprises (from the N-terminus to the 
C-terminus) the signal sequence and extracellular domain of gpl30 attached to a 

30 (Gly4Ser)s polypeptide linker which is attached to the mature coding sequence of the 
LIF-R extracellular domain. 
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Example 5 

Pw . rn rF„ a -nn Prot o n rwnnrising T JF-R Attached to n>1 3Q Through An Fc 

P»lYpftptT<1e Linker 

5 

A receptor prepared in accordance with the following procedures is depicted in 
Figure 4. The following oligonucleotides were synthesized for use in preparing the 
receptor fusion protein: 
SEQIDNO:l6 

10 3' CAIACATACACCACTGTTTCCTTTTAAGACTCGGGTCTAGATACG 5' (oligonucleotide 

no. 9) 

SEQIDNO:17 

3' CAAACG&GTTCCTCTTIAACTTCTCGGGTCTAGaTACG 5* (oligonucleotide no. 10) 

15 

An LIF-R encoding DNA sequence is isolated and amplified in a PCR reaction 
using oligonucleotides 1 and 9. Oligonucleotide number 1 (the 5' primer) inserts an 
upstream Sail site and has been described in Example 3. The 3' primer is 
• oligonucleotide number 9 which includes a sequence complementary to nucleotides 

20 2651 to 2677 of SEQ ID NO: 5, i.e., includes antisense nucleotides encoding the last 
nine amino acids of the extracellular domain of LIF-R, Oligonucleotide number 9 also 
inserts a downstream BgHTsite. The PCR reaction products are digested with Sail and 
BglH, and the desired LIF-R encoding DNA fragment is isolated by gel electrophoresis 
using conventional procedures. Due to the presence of an internal BglH site in theUF- 

25 R sequence, the BglH digestion should be carried out under conditions that effect partial 
digestion. 

A gpl30 encoding DNA fragment is isolated and amplified by PCR reaction 
using oligonucleotides 5 and 10. The S primer (oligonucleotide number 5) inserts an 
upstream Sail site and has been described above in Example 4. The 3' primer is 

30 oligonucleotide number 10, which includes a sequence complementary to nucleotides 
2080 to 2100 of SEQ ID NO: 1, Le., includes antisense nucleotides encoding the last 
seven amino acids of the gpl30 extracellular domain. Oligonucleotide number 10 also 
inserts a downstream BglH site. The PCR reaction products are digested with Sail and 
Bgin, and the desired gpl30 encoding DNA fragment is isolated by gel electrophoresis 

35 using conventional techniques. 

cDNA encoding a single chain polypeptide derived from the Fc region of a 
human IgGl antibody has been cloned into the above-described pBLUESCRIPT SK® 
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vector to produce a recombinant vector designated hlgGlFc. A unique Bgin site is 
positioned near the 5' end of the inserted Fc encoding sequence. An Spel site is 
immediately downstream of the stop codon. The DNA and encoded amino acid 
sequences of the cloned FccDNA are presented in SEQ ED NO: 3 and SEQ ID NO: 4. 
5 The Fc polypeptide encoded by the cDNA extends from the N-terminal hinge 

region to the native C-terminus, i.e., is an essentially full-length antibody Fc region. 
Fc fragments, e.g., those that are truncated at the C-terminal end, also may be 
employed. The fragments should contain multiple cysteine residues (at least the 
cysteine residues in the hinge reaction). The antibody from which the Fc polypeptide is 

10 derived is preferably of the same species as the patient to be treated with the fusion 
protein prepared therefrom.. 

Plasmid hlgGlFc is digested with Bglll and Sail and the Bglll/Sall LIF-R 
fragment prepared above is ligated into the vector by conventional techniques. The Fc 
encoding sequence is positioned downstream of, and in the same reading frame as, the 

15 LIF-R sequence. In a separate reaction, the above-described Sall/Bgin fragment of 
gpl30 is also inserted into the same vector. Plasmid vectors containing the desired 
DNA insert are identified by restriction endonuclease digestion analysis, using 
convention techniques. 

The cloned DNA segment encoding the LIF-R-Fc fusion polypeptide may be 

20 excised from the pBLUESCRIPT SK® vector by digestion with Sail and NotL 
Likewise; the DNA segment encoding the gpl30-Fc fusion polypeptide may be excised 
by Sall/NotI digestion. Each excised DNA segment is inserted into an appropriate 
expression vector, depending on the type of host cell that is desired. One suitable 
expression vector is the plasmid pDC406, which may be transformed into mammalian 

25 host cells as described in Example 3. 

In one embodiment of the invention, an expression vector encoding the LIF-R- 
Fc fusion and a second expression vector encoding the gpl30-Fc fusion are co- 
transfected into the desired host cells. Two separate recombinant polypeptides are thus 
produced in the host cells. The first polypeptide comprises the Fc polypeptide fused in 

30 frame to the C-terminus of the gpl30 fragment. The second polypeptide comprises the 
Fc polypeptide fused in frame to the C-terminus of the LJF-R fragment Disulfide 
bonds that form between the two Fc regions covalently link the two separate fusion 
polypeptides into a receptor protein of the present invention. 

Alternatively, the LIF-R-Fc and gpl30-Fc polypeptides may be separately 

35 transformed into host cells (as opposed to co-transfection into the same host cell.) The 
two polypeptides are purified from the host cells and then combined in a suitable 
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buffered solution, whereupon interchain disulfide bonds' form between the two Fc 

regions. \ . 

" The receptor protein may be purified using any of a number of conventional 

protein purification techniques. Since antibody Fc regions bind to protein A and 
5 protein G, affinity chromatography employing protein A or protein G attached to an 
insoluble support material may be employed in the purification process. In one 
procedure, onelher of culture supernatant containing the receptor is passed over a solid 
phase protein G column, and the column is men washed thoroughly with phosphate- 
buffered saline (PBS). The adsorbed Fc-containing fusion protein is eluted with 50 
10 mM glycine buffer, pH 3 and brought to pH 7 with 2 MTris buffer, pH 9. Further 
purification may involve irnmunoaffinity column(s), eg., affinity columns having LIF 
or OSM bound thereto. 

Example 6 

15 P^ pn-atirm nfM™™ln™l Antibodies Directed affrinst a Receptor 

Preparations of a purified receptor protein of the present invention, or 
transfected COS cells expressing high levels of the receptor, are employed to generate 
monoclonal antibodies against the receptor using conventional techniques, for example, 
20 those disclosed in U.S. Patent 4,411,993. To immunize mice, a receptor immunogen 
is emulsified in complete Freunds adjuvant and injected subcutaneously in amounts 
ranging from 10-lOOU-g into Balb/c mice. Ten to twelve days later, the irnmunized 
animals are boosted with additional immunogen emulsified in incomplete Freunds 
adjuvant and periodically boosted thereafter on a weekly to biweekly immunization 

25 schedule. Serum samples are periodically taken by retro-orbital bleeding or tail-tip 
excision for testing by dot-blot assay (antibody sandwich) or ELISA (enzyme-linked 
immunosorbent assay). Other assay procedures are also suitable. Following detection 
of an appropriate antibody titer, positive ammals are given an intravenous injection of 
antigen in saline. Three to four days later, the animals are sacrificed, splenocytes 

30 harvested, and fused to the murine myeloma cell line NS1. Hybridoma cell lines 

generated by this procedure are plated in multiple microtiter plates in a HAT selective 
medium (hypoxanrine, arnmopterin, and mymidine) to inhibit proliferation of non-fused 
cells, myeloma hybrids, and spleen cell hybrids. 

Hybridoma clones thus generated can be screened by ELISA for reactivity with 

35 the receptor protein, for example, by adaptations of the techniques disclosed by Engvall 
etzLJmmunochemSXll (1971) and in U.S. Patent 4,704,004. Positive clones are 
then injected into the peritoneal cavities of syngeneic Balb/c mice to produce ascites 
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containing high concentrations (greater than 1 mg/ml) of anti-receptor monoclonal 
antibody. The resulting monoclonal antibody can be purified by ammonium sulfate 
precipitation followed by gel exclusion chromatography, and/or affinity 
chromatography based on binding of antibody to Protein A of Staphylococcus aureus. 

5 

Example 7 

Peterodimeric Receptor 

An expression vector encoding a fragment of the human LEF-R extracellular 

10 domain fused to a polypeptide derived from the Fc region of an antibody was 

constructed as follows. A second expression vector encoding a fragment of the human 
gpl30 extracellular domain fused to an Fc polypeptide also was constructed. 

Plasmid pHLIF-R-65 (ATCC 68491), which contains human LIF-R cDNA in 
expression vector pDC303 as described in example 3, was digested with the restriction 

15 enzymes Asp718 and XmnI. Asp718 cleaves the vector upstream of the LIF-R cDNA 
insert. XmnI is a blunt cutter that cleaves within the codon for amino acid number 702 
(Asp) of SEQ ID NO:5, upstream of the transmembrane region. The desired 
Asp718PCmnI fragment (about 2,444 bp in length) was separated by electrophoresis on 
an agarose gel and purified by conventional procedures using an Elutip column. 

20 A recombinant vector designated hlgGlFc, comprising cDNA encoding a single 

chain polypeptide derived from the Fc region of a human IgGl antibody in a 
pBLUESCRIPT SK® vector was described in example 5. The DNA and encoded 
amino acid sequences of the cloned Fc cDNA are presented in SEQ ID NO:3 and SEQ 
ID NO:4. A polylinker region comprising a number of restriction sites is positioned 

25 immediately upstream of the Fc cDNA. 

Plasmid hlgGlFc was digested with Asp718 and StuI, which cleave within the 
pollylinker upstream of the Fc sequence. The Asp718PCmnI LIF-R fragment prepared 
above was ligated into the cleaved hlgGlFc vector by conventional techniques. StuI 
and XmnI both produce blunt ends, which will ligate together. In the resulting 

30 recombinant vector, the Fc encoding sequence is positioned downstream of, and in the 
same reading frame as, the LIF-R sequence. The encoded LIF-R/Fc fusion protein 
comprises amino acids -44 to 702 of SEQ ID NO:5 (LIF-R), followed by six amino 
acids constituting a peptide linker encoded by the polylinker segment of plasmid 
hlgGlFc, followed by amino acids 1-232 of SEQ ID NO:3 (Fc). E. coli cells were 

35 transformed with the ligation mixture and plasmids were isolated therefrom by standard 
procedures. Plasmid vectors containing the desired DNA insert were identified by 
restriction endonuclease digestion analysis. 
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The clonedDNA segment encoding the LIF-R/Fc fusion polypeptide was 
excised from the recombinant vector by digestion with Asp718 and Nod . The Nod 
enzyme cleaves the vector in a poly linker region just downstream of the Fc cDNA 
insert. The excised DNA segment (3.2 kb) is inserted into an appropriate expression 
5 vector, depending on the type of host cell that is desired. One suitable expression 
vector is pCAV/NOT, a mammalian expression vector described in PCT application 
WO 90/05183. 

pCAV/NOT was cleaved with Asp7I8 and Nod, both of which cleave in the 
multiple cloning site. The LIF-R/Fc-encoding Asp718/Nod DNA fragment prepared 

10 above was Iigated into the vector. 

An expression vector encoding a soluble gpl30/Fc fusion protein was 
constructed as follows. Recombinant vector B10G/pDC303 (ATCC 68827) 
comprising human gpl30 cDNA (described in example 3) was digested with EcoRl, 
and the resulting S overhang was rendered blunt using T4 DNA polymerase. The 

15 recognition site for EcoRl comprises nucleotides 2056-2061 of SEQ ID NO:l. The 
EcoRl-digested vector was then cleaved with Xhol, which cleaves in the vector 
upstream of the gpl30 cDNA insert . 

Plasmid hlgGlFc, comprising Fc polypeptide-encoding cDNA as described 
above, was digested with StuI (a blunt cutter) and Notl, which cleave upstream and 

20 downstream, respectively, of the inserted Fc cDNA. The (EcoRl)/XhoI gpl30 
fragment isolated above was Iigated to the Fc-containing fragment and to Xhol/Notl- 
digested vector SF CAV/NOT. The mammalian expression vector SF CAV/NOT is 
essentially identical to SF CAV (ATCC 68922) but contains a Notl site. SF CAV/NOT 
also is essentially identical to pCAV/NOT, described in PCT application WO 90/05183, 

25 except that a segment of the adenovirus-2 tripartite leader (TPL) containing a cryptic 
promoter functional in bacteria has been deleted. Protein expression from the cryptic 
promoter is potentially disadvantageous for preparing and isolating a desired 
recombinant plasmid in bacterial cells. 

E. coli cells were transformed with the ligation mixture, plasmids were isolated 

30 therefrom by conventional procedures, and the desired recombinant plasmids were 
identified by restriction analysis. The gpl30/Fc fusion protein encoded by the desired 
recombinant vector comprises (from N- to C-terminus) amino acids -22 to 582 of SEQ 
ID NO:2 (gpl30), followed by 7 amino acids constituting a peptide linker encoded by 
the polylinker segment of plasmid hlgGlFc, followed by amino acids 1-232 of SEQ ID 

35 NO:4 (Fc). 

COS-7 cells (ATCC CRL 1651) were transfected with either the UF-R/Fc- 
encoding recombinant expression vector or the gpI30/Fc-encoding expression vector 
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prepared above, or with both expression vectors. The cells were cultivated to allow 
expression of the soluble fusion proteins. The expressed proteins were recovered by 
incubating culture supernatant with Protein G Sepharose beads (available from 
Pharmacia) overnight at 4°C, then pelleting the beads by centrifugation. The binding of 

5 125 I-labeled human oncostatin M and 125 I-labeled human LIF by the proteins bound to 
the beads was analyzed. 

Binding affinity was determined by performing a variation of a standard 
Scatchard analysis. The binding assay procedure was similar to that described by 
Mosley et al. (Cell 59:335, 1989) except that the proteins, being soluble, are attached to 

10 Protein G Sepharose beads rather than being on the surface of the transfected cells. 
Briefly, in a 96-well microliter plate, each of ten 1:2 serial dilutions of 125 I-LIF or 125 I- 
oncostatin M was incubated with a sample comprising the expressed proteins (bound to 
the beads) resuspended in RPMI 1640 containing 2.5% bovine serum albumin, 0.2% 
(v/v) sodium azide and 20mM Hepes, pH 7.4, for 2 hours at 4°C with agitation. 

15 Duplicate standard cold competition wells also were incubated. Centrifuge tubes 
containing bovine calf serum were used in place of the phthalate oil mixture-containing 
tubes in the separation method described by Dower et al., J. Immunol. 132:751 (1984) 
and Park et al., /. Biol Chem. 261:4177 (1986) and in example 1 above. Aliquots of 
each incubation mixture were transferred to the tubes. After centrifugation, tubes were 

20 cut, die radioactivity counted, and processed as for standard Scatchard analysis. 

Figure 7 presents Scatchard analyses of the binding of 125 I oncostatin M by 
gpl30/Fc homodimers produced by the cells transfected with the gpl30/Fc vector alone 
(upper left) and by the proteins expressed by the co-transfected cells (lower left). 
Scatchard analyses of the binding of 125 I LIF by LIF-R/Fc homodimers produced by 

25 the cells transfected with the LIF-R/Fc vector alone (upper right) and by the proteins 
expressed by the co-transfected cells (lower right) are also presented in figure 7. A 
shift toward higher affinity binding of oncostatin M by the proteins recovered from the 
co-transfected cells, compared to the gpl30/Fc homodimer, is evident from figure 7. 
Likewise, the data in figure 7 indicate a shift toward higher affinity binding of LIF by 

30 the proteins recovered from the co-transfected cells, compared to the LIF-R/Fc 
homodimer. The shift toward higher affinity binding indicates the presence of 
heterodimers comprising LIF-R/Fc and gpl30/Fc, and further indicates that the LIF-R 
and gpI30 moieties are cooperating, i.e., interacting, in the binding of oncostatin M 
and LIF. Controls demonstrated no oncostatin M binding by LIF-R homodimers, and 

35 no LIF binding by gpl 30 homodimers. 
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Example 8 

EgesBSaS rhm prirfng UF - R mwf ftt! 30 P olypeptides Lacking FNUipomai^ 

DNA sequences encoding soluble LEF-R and gp 130 proteins lacking fibronectin 
5 type HI (FNIH) domains were isolated and fused to an Fc-encoding sequence. Deleting 
the FNm domains affords the advantage of reducing the size of the LJF-R/Fc and 
gpl30/Fc fusion proteins. The LIF-R protein of SEQ ID NO:6 comprises three repeats 
of a fibronectin type IE-like module in the extracellular domain. The three domains 
containing FNIH modules comprise amino acids 487 (Thr) to 584 (Asn), 585 (Asp) to 
10 679 (Ala), and 680 (Pro) to 789 (Ser), respectively, of SEQ ID NO:6. gpl30 also 
contains three FNm domains, comprising amino acids 300 <Tyr) to 399 (Phe), 400 
(Gin) to 496 (Pro), and 497 (Pro) to 597 (Glu), respectively, of SEQ ID NO:2. From 
one to all three of the FNm domains may be removed from gpl30 or LIF-R to reduce 
the size of the protein . 

15 The FNm domains of human LIF-R were removed by digesting the LEF-R/Fc- 

encoding expression vector prepared in example 7 with the restriction endonuclease Eco 
O 1091 (isoschizomer of Dra H) and filling in the resulting overhangs using T4 DNA 
polymerase according to conventional procedures. The recognition site for Eco O 1091 
spans nucleotides 1789-1795 of SEQ ID NO:5 (LIF-R), cleaving within the codons for 

20 amino acids 8-9 of the first FNm domain of LIF-R. The cleaved vector was then 
digested with BstXl and EcoR5. The recognition site for BstXl spans nucleotides 
1048-1059 of SEQ ID NO:5 and EcoR5 (which generates blunt ends) cleaves within 
thepolylinker upstream of the Fc sequence. The BstXl/EcoR5 fragment (comprising 
the 5" end of LIF-R, the vector, the entire Fc sequence, and a portion of the polylinker) 

25 and the BstXl/(Eco O 109D LIF-R fragment were isolated and ligated together. E. coli 
cells were transformed with the ligation mixture, plasmids were isolated therefrom, and 
the desired recombinant plasmid was identified by restriction analysis. The resulting 
construct encodes a fusion protein comprising (from N- to C-terminus) amino acids -44 
to 494 of SEQ ID NO:5 (LIF-R), a four amino acid spacer peptide -His-Arg-Tyr-Val- 

30 encoded by thepolylinker segment, and amino acids 1-232 of SEQ ID NO:3 (Fc). The 
LIF-R polypeptide moiety contains the first 8 amino acids of the first FNm domain, 
but lacks the remainder of the first FNm domain and all of the second and third FNm 
domains. 

The FNm domains of gpI30 were removed by digesting the recombinant 
35 gpl30/Fc-encoding expression vector prepared in example 7 with BstXl, then blunting 
the overhang using T4 DNA polymerase according to conventional procedures. The 
recognition site for BstXl spans nucleotides 1231-1242 of SEQ ID NO:l (gpl30), 
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cleaving within the codons for amino acids 10-11 of the first FNIII domain of gpl30. 
The cleaved vector was then digested with EcoR5, which cleaves within the polylinker 
upstream of the Fc sequence and generates blunt ends. The (BstXl)/EcoR5 fragment 
comprising the 5' end of gpl30 Qacking the FNIII domains), the vector sequences, the 
5 Fc sequence, and a portion of the polylinker, was ligated. E. coli cells were 
transformed with the ligation mixture, plasmids were isolated therefrom, and the 
desired recombinant plasmid was identified by restriction analysis. The fusion protein 
encoded by the construct comprises (from N- to C-terminus) amino acids -22 to 308 of 
SEQ ID NO:2 (gpl30), a four amino acid spacer peptide -Asn-Arg-Tyr-Val- encoded 

10 by the polylinker segment, and amino acids 1-232 of SEQ ID NO:3 (Fc). The gpl30 
polypeptide moiety contains the first 9 amino acids of the first FNIII domain, but lacks 
the remainder of the first FNIH domain and all of the second and third FNIII domains. 

FNTH domains may be deleted from the gpl30 component of a receptor of the 
present invention, from the UDF-R component, or from both. In one embodiment of the 

15 invention, COS -7 cells were co-transfected with the soluble LIF-R/Fc-encoding 
mammalian expression vector prepared in example 7 and the mammalian expression 
vector encoding a soluble gpl30/Fc protein lacking the FNm domains prepared above. 
Analysis of the expressed proteins by SDS-PAGE revealed a band of the molecular 
weight expected for the heterodimer, along with bands that include those of the 

20 molecular weight expected for the two homodimers. Scatchard analyses conducted 
according to the procedures described in example 7 demonstrated a shift toward higher 
affinity binding of LIF and oncostatin M for proteins expressed by the co-transfected 
cells compared to the corresponding homodimers. This result indicates the presence of 
heterodimers comprising LEF-R/Fc and gpl30/Fc, and further indicates that the LIF-R 

25 and gp!30 moieties are cooperating, i.e., interacting, in binding oncostatin M and LIF. 

pRjEF DESCRIPTION OF TffE SEQUENCE MSTTNG 

SEQ ID NO:l and SEQ ID NO:2 present the DNA sequence and encoded amino acid 
30 sequence for cloned cDNA encoding an N-terminal fragment of gp 1 30. 

SEQ ID NO:3 and SEQ ID NO:4 present the DNA sequence and encoded amino acid 
sequence for cloned cDNA encoding a polypeptide that corresponds to the Fc region of 
an IgGl antibody. 

35 

SEQ ID NO:5 and SEQ ID NO:6 present the DNA sequence and encoded amino acid 
sequence for cloned cDNA encoding an N-tenninal fragment of LIF-R. 
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SEOmNOrTpresentstheDNAsequenceofthccodingstrandofachemic^y . 
synthesized DNA molecule encoding a polypeptide linker used in constructing certain 



SEQIDN08 - SEQ1D NO:17 present theDNA sequence of various single-stranded 
oligonucleotide primers employed in polymerase chain reactions to construct certam 
receptors of the present invention. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Gearing, David P. 

(ii) TITLE OF INVENTION: Receptor for Oncostatin M and Leukemia 
Inhibitory Factor 

(iii) NUMBER OF SEQUENCES: 17 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Immunex Corporation 

(B) STREET: 51 University Street 
<C> CITY: Seattle 

(D) STATE: WA 

(E) COUNTRY: USA 

(F) ZIP: 98101 

<v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Seese, Kathryn A. 

(B) REGISTRATION NUMBER: 32,172 

(C) REFERENCE / DOCKET NUMBER: 2607 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: 206-587-0430 
<B) TELEFAX: 206-587-0606 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2369 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: N-terminal 

(vi) ORIGINAL SOURCE: 

(F) TISSUE TYPE: human placenta 



(vii) IMMEDIATE SOURCE: 
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(B) CLONE: B10G/pDC303 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 244.. 2369 

(ix> FEATURE: 

(A) NAME/KEY: mat _peptxae 

(B) LOCATION: 310.. 2369 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 244.. 309 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
GGCCCGCGGA GTCGCGCTGG GCCGCCCCGG CGCAGCTGAA CCGGGGGCCG CGCCTGCCAG 
GCCGACGGGT CTGGCCCAGC CTGGCGCCAA GGGGTTCGTG CGCTGTGGAG ACGCGGAGGG 
TCGAGGCGGC GCGGCCTGAG TGAAACCCAA TGGAAAAAGC ATGACATTTA GAAGTAGAAG 
ACTTAGCTTC AAATCCCTAC TCCTTCACTT ACTAATTTTG TGATTTGGAA ATATCCGCGC 
~~ mm/- irT TGG CTA GTG CAA GCC TTG TTT ATT TTC 

— S £ SS £ 5 St ?.! «. "a l» »e XL P„e 
-22 -20 _15 

S5S = S?SSS = = = JS = SSS 

10 1S 

_ ___ rRA TGX ATG gat tat ttt cat gta aat gct art 

TGT GTG CTA AAG GAA AAA TGT Ai^ W|x Ala Asn 

Cys Val Leu Lys Glu Lys Cys Met Asp Tyr rue m ^ 

s s s 5 z s = s s 2 s ^ s s a £ 



45 



_ ,„ ___ nvrr AGT GTC ACC TTT ACA GAT ATA GCT 

£ S S JS E S S S £ vS Thr Thr *,p XXe ^ 
60 65 

— r-»r- rTr ACT TGC AAC ATT CTT ACA TTC GGA CAG CTT 

2SSS22£S» »• — *s fte sly sln " u 

75 80 
r „ rAG «AT GTT TAT GGA ATC ACA ATA ATT TCA GGC TTG CCT CCA GAA 
Su" Sn Val Tyr Gly He Thr Xle Xle Ser Gly Leu Pro Pro Glu 

90 95 
AAA CCT AAA AAT TTG AGT TGC ATT GTG AAC GAG GGG AAG AAA ATG AGG 
SI So J£ Sn Leu Ser Cys He Val Asa Glu Gly Lys Lys Met Arg 
110 115 



60 
120 
180 
240 
288 

336 

384 

432 

480 

528 

576 

624 

672 
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TGT GAG TGG GAT GGT GGA AGG GAA ACA CAC TTG GAG ACA AAC TTC ACT 

Cys Glu Trp Asp Gly Gly Arg Glu Thr His Leu Glu Thr Asn Phe Thr 
125 130 135 

TTA AAA TCT GAA TGG GCA ACA CAC AAG TTT GCT GAT TGC AAA GCA AAA 
Leu Lys Ser Glu Trp Ala Thr His Lys Phe Ala Asp Cys Lys Ala Lys 
140 145 150 

CGT GAC ACC CCC ACC TCA TGC ACT GTT GAT TAT TCT ACT GTG TAT TTT 
Arg Asp Thr Pro Thr Ser Cys Thr Val Asp Tyr Ser Thr Val Tyr Phe 
155 160 165 

GTC AAC ATT GAA GTC TGG GTA GAA GCA GAG AAT GCC CTT GGG AAG GTT 
Val Asn lie Glu Val Trp Val Glu Ala Glu Asn Ala Leu Gly Lys Val 
170 175 180 185 

ACA TCA GAT CAT ATC AAT TTT GAT CCT GTA TAT AAA GTG AAG CCC AAT 
Thr Ser Asp His lie Asn Phe Asp Pro Val Tyr Lys Val Lys Pro Asn 
190 195 200 

CCG CCA CAT AAT TTA TCA GTG ATC AAC TCA GAG GAA CTG TCT AGT ATC 
Pro Pro His Asn Leu Ser Val lie Asn Ser Glu Glu Leu Ser Ser lie 
205 210 215 

TTA AAA TTG ACA TGG ACC AAC CCA AGT ATT AAG AGT GTT ATA ATA CTA 
Leu Lys Leu Thr Trp Thr Asn Pro Ser lie Lys Ser Val He He Leu 
220 " 225 230 

AAA TAT AAC ATT CAA TAT AGG ACC AAA GAT GCC TCA ACT TGG AGC CAG 
Lys Tyr Asn lie Gin Tyr Arg Thr Lys Asp Ala Ser Thr Trp Ser Gin 
235 240 245 

ATT CCT CCT GAA GAC ACA GCA TCC ACC CGA TCT TCA TTC ACT GTC CAA 
He Pro Pro Glu Asp Thr Ala Ser Thr Arg Ser Ser Phe Thr Val Gin 
250 255 260 265 

GAC CTT AAA CCT TTT ACA GAA TAT GTG TTT AGG ATT CGC TGT ATG AAG 
Asp Leu Lys Pro Phe Thr Glu Tyr Val Phe Arg He Arg Cys Met Lys 
270 275 280 

GAA GAT GGT AAG GGA TAC TGG AGT GAC TGG AGT GAA GAA GCA AGT GGG 
Glu Asp Gly Lys Gly Tyr Trp Ser Asp Trp Ser Glu Glu Ala Ser Gly 
285 ** 290 295 

ATC ACC TAT GAA GAT AGA CCA TCT AAA GCA CCA AGT TTC TGG TAT AAA 
He Thr Tyr Glu Asp Arg Pro Ser Lys Ala Pro Ser Phe Trp Tyr Lys 
300 305 310 

ATA GAT CCA TCC CAT ACT CAA GGC TAC AGA ACT GTA CAA CTC GTG TGG 
He Asp Pro Ser His Thr Gin Gly Tyr Arg Thr Val Gin Leu Val Trp 
315 320 325 



720 



768 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 



AAG ACA TTG CCT CCT TTT GAA GCC AAT GGA AAA ATC TTG GAT TAT GAA 
Lys Thr Leu Pro Pro Phe Glu Ala Asn Gly Lys He Leu Asp Tyr Glu 
330 335 340 345 



1344 



GTG ACT CTC ACA AGA TGG AAA TCA CAT TTA CAA AAT TAC ACA GTT AAT 
Val Thr Leu Thr Arg Trp Lys Ser His Leu Gin Asn Tyr Thr Val Asn 
350 355 360 
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„. -m* r-rr ACA AAT GAT CGC TAT CTA GCA ACC 

365 



22S52SSSSSSSS5S2 

380 385 

_ mm „ „- j. C m CAC CCT GTA ATG GAT CTT AAA 

K S2E" EE 2SESSS — 3J — «-"-«•■ 

395 400 

r «m tcg GTG GAA TGG ACT ACT CCA AGG 

GCA TTC CCC AAA GAT AAC ATG CTT TGG GTG ^ Thr Pro Ar g 

Ala Phe Pro Lys Asp Asn Met Leu Txp Val wu *s 
410 415 

445 450 



460 4 " 

SS5SSSSSSSSS2SSS 

47S 480 

». r-aa cct CCA CCT TCC AAA GGA CCT ACT GTT CGG ACA 
GCA TAC CTT AAA CAA GCT CCA CCT TW. Thr Val Arg Thr 

Ala Tyr Leu Lys Gin Ala Pro Pro ser iys 5Q5 
490 495 

™» isct riT TTX GAG TGG GAC CAA CTT CCT 

jss:;ss£:~~e s s. - - s « 

* 510 515 

C S S Sj S S E E S 5= £ S - E 5 25 



525 



SSS5SS2SSSSSE5SS 

540 545 

2SSS2SSSSSS2SSSS 

S55 560 

570 575 

„ „. _. r rnmm GCT CAA GGA GAA ATT GAA GCC ATA GTC GTG 
TTT ACT ACC CCA AAG TTT GCT CAA tojA b*» Ala Ile Val Val 

Phe Thr Thr Pro Lys Phe Ala Gin Gly GXu ixe 



1440 



1488 



1536 



1584 



1632 



1680 



1728 



1776 



1824 



1872 



1920 



1968 



2016 



2064 



2112 
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CCT GTT TGC TTA GCA TTC CTA TTG ACA ACT CTT CTG GGA GTG CTG TTC 2160 

Pro Val Cys lieu Ala Phe Leu Leu Thr Thr Leu Leu Gly Val Leu Phe 
605 610 " 615 

TGC TTT AAT AAG CGA GAC CTA ATT AAA AAA CAC ATC TGG CCT AAT GTT 2208 
Cys Phe Asn Lys Arg Asp Leu lie Lys Lys His He Trp Pro Asn Val 
620 625 630 

CCA GAT CCT TCA AAG AGT CAT ATT GCC CAG TGG TCA CCT CAC ACT CCT 2256 
Pro Asp Pro Ser Lys Ser His He Ala Gin Trp Ser Pro His Thr Pro 
635 640 645 

CCA AGG CAC AAT TTT AAT TCA AAA GAT CAA ATG TAT TCA GAT GGC AAT 2304 
Pro Arg His Asn Phe Asn Ser Lys Asp Gin Met Tyr Ser Asp Gly Asn 
650 " 655 660 665 

TTC ACT GAT GTA AGT GTT GTG GAA ATA GAA GCA AAT GAC AAA AAG CCT 2352 
Phe Thr Asp Val Ser Val Val Glu He Glu Ala Asn Asp Lys Lys Pro 
670 675 680 

TTT CCA GAA GAT CTG AA 2369 

Phe Pro Glu Asp Leu 
685 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 708 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Leu Thr Leu Gin Thr Trp Leu Val Gin Ala Leu Phe lie Phe Leu 
-22 -20 -15 -10 

Thr Thr Glu Ser Thr Gly Glu Leu Leu Asp Pro Cys Gly Tyr lie Ser 
-5 15 10 

Pro Glu Ser Pro Val Val Gin Leu His Ser Asn Phe Thr Ala Val Cys 
15 20 25 

Val Leu Lys Glu Lys Cys Met Asp Tyr Phe His Val Asn Ala Asn Tyr 
30 35 40 

He Val Trp Lys Thr Asn His Phe Thr He Pro Lys Glu Gin Tyr Thr 
45 50 55 

He He Asn Arg Thr Ala Ser Ser Val Thr Phe Thr Asp He Ala Ser 
60 65 70 

Leu Asn He Gin Leu Thr Cys Asn He Leu Thr Phe Gly Gin Leu Glu 
75 80 85 90 

Gin Asn Val Tyr Gly He Thr He He Ser Gly Leu Pro Pro Glu Lys 
95 100 105 
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Pro lys Asn Leu Ser Cys He Val Asn Glu Gly Lys Lys Met Arg Cys 

" 110 115 
Glu Trp Asp Gly Gly Arg Glu Thr His Leu Glu Thr Asn Phe Thr Leu " 

125 130 
Lys Ser Glu Trp Ala Thr His Lys Phe Ala Asp Cys Lys Ala Lys Arg 

140 145 
Asp Thr Pro Thr Ser Cys Thr Val Asp Tyr Ser Thr Val Tyr Phe Val 
155 160 

Asn He Glu Val Trp Val Glu Ala Glu Asn Ala Leu Gly Lys Val Thr 
175 180 

Ser Asp His He Asn Phe Asp Pro Val Tyr Lys Val Lys Pro Asn Pro 
19 q 195 

Pro His Asn Leu Ser Val He Asn Ser Glu Glu Leu Ser Ser He Leu 

205 210 215 

lys Leu Thr Trp Thr Asn Pro Ser He Lys Ser Val He He Leu Lys 

220 225 
Tyr Asn He Gin Tyr Arg Thr Lys Asp Ala Ser Thr Trp Ser Gin He 
235 



Pro Pro Glu Asp Thr Ala Ser Thr Arg Ser Ser Phe Thr Val Gin Asp 
255 260 

Lys Pro Phe Thr Glu Tyr Val Phe Arg He Arg Cys Met Lys Glu 
270 275 

Asp Gly Lys Gly Tyr Trp Ser Asp Trp Ser Glu Glu Ala Ser Gly He 
285 

Thr Tyr Glu Asp Arg Pro Ser Lys Ala Pro Ser Phe Trp Tyr Lys He 

300 305 
Asp Pro Ser His Thr Gin Gly Tyr Arg Thr Val Gin Leu Val Trp Lys 
315 320 

Thr Leu Pro Pro Phe Glu Ala Asn Gly Lys He Leu Asp Ty= Glu Val 
335 340 

Thr Leu Thr Arg Trp Lys Ser His Leu Gin Asn Tyr Thr Val Asn Ala 
350 355 

Thr Lys Leu Thr Val Asn Leu Thr Asn Asp Arg Tyr Leu Ala Thr Leu 

365 370 
Thr val Arg Asn Leu Val Gly Lys Ser Asp Ala Ala Val Leu Thr He 

380 385 
Pro Ala Cys Asp Phe Gin Ala Thr His Pro Val Met Asp Leu Lys Ala 
395 

Phe Pro Lys Asp Asn Met Leu Trp Val Glu Trp Thr Thr Pro Arg Glu 
4X5 420 
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Ser Val Lys Lys Tyr lie Leu Glu Trp Cys Val Leu Ser Asp Lys Ala 
' 430 435 440 

Pro Cys lie Thr Asp Trp Gin Gin Glu Asp Gly Thr Val His Arg Thr 
445 450 455 

Tyr Leu Arg Gly Asn Leu Ala Glu Ser Lys Cys Tyr Leu lie Thr Val 
460 465 470 

Thr Pro Val Tyr Ala Asp Gly Pro Gly Ser Pro Glu Ser He Lys Ala 
475 480 485 490 

Tyr Leu Lys Gin Ala Pro Pro Ser Lys Gly Pro Thr Val Arg Thr Lys 
495 500 505 

Lys Val Gly Lys Asn Glu Ala Val Leu Glu Trp Asp Gin Leu Pro Val 
510 515 520 

Asp Val Gin Asn Gly Phe He Arg Asn Tyr Thr lie Phe Tyr Arg Thr 
525 530 535 

lie lie Gly Asn Glu Thr Ala Val Asn Val Asp Ser Ser His Thr Glu 
540 545 550 

Tyr Thr Leu Ser Ser Leu Thr Ser Asp Thr Leu Tyr Met Val Arg Met 
555 560 565 570 

Ala Ala Tyr Thr Asp Glu Gly Gly Lys Asp Gly Pro Glu Phe Thr Phe 
575 580 585 

Thr Thr Pro Lys Phe Ala Gin Gly Glu lie Glu Ala He Val Val Pro 
590 595 600 

Val Cys Leu Ala Phe Leu Leu Thr Thr Leu Leu Gly Val Leu Phe Cys 
605 610 615 

Phe Asn Lys Arg Asp Leu lie Lys Lys His lie Trp Pro Asn Val Pro 
620 625 630 

Asp Pro Ser Lys Ser His He Ala Gin Trp Ser Pro His Thr Pro Pro 
635 640 645 650 

Arg His Asn Phe Asn Ser Lys Asp Gin Met Tyr Ser Asp Gly Asn Phe 
655 660 665 

Thr Asp Val Ser Val Val Glu He Glu Ala Asn Asp Lys Lys Pro Phe 
670 675 680 

Pro Glu Asp Leu 
685 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 705 base pairs 

(B) TYPE: nucleic acid 
<C) STRAND EDNES S : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
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(iii) HYPOTHETICAL : NO 
(iv) ANTI-SENSE: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: hlgGlFc 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
(B> LOCATION: 1-.699 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GAS CCC AGA TCT TGT GAC AAA ACT CAC ACA TGC CCA CCG TGC CCA GCA 
iro £S sS Cys Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala 
1 5 
CCT GAA CTC CTG GGG GGA CCG TCA GTC TTC CTC TTC CCC CCA AAA CCC 
Pro Su* Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro 



20 



„ r r , c ACC CTC IIS ATC TCC CGG ACC CCT GAG GTC ACA TGC GTG GTG 
J£ tS 2u Me? lie Ser Arg Thr Pro Glu Val Thr Cys Val Val 



GTG GAC GTG AGC CAC GAA GAC CCT GAG GTC AAG TTC AAC TGG TAC GTG 
S? 2£ 5a? 5er His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val 

GAC GGC GTG GAG GTG CAT AAT GCC AAG ACA AAG CCG CGG GAG GAG CAG 
2p 25 val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gin 

65 70 
TAC AAC AGC ACG TAC CGG GTG GTC AGC GTC CTC ACC GTC CTG CAC CAG 
Tyr" Jsn Ser Thr Tyr Arg Val Val Ser Val Leu Thr val Leu His Gin 

GAC TGG CTG AAT GGC AAG GAC TAC AAG TGC AAG GTC. TCC AAC AAA GCC 
Sp £5 22 Jsn Gly Lys Asp Tyr Lys Cys Lys Val Ser Asn Lys Ala 



100 



CTC CCA GCC CCC ATG CAG AAA ACC ATC TCC AAA GCC AAA GGG CAG CCC 
2£ Pro" £a Pro Met Gin Lys Thr He Ser Lys Ala Lys Gly Gin Pro 
115 120 125 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



CGA GAA CCA CAG GTG TAC ACC CTG CCC CCA TCC CGG GAT GAG CTG ACC 
S Su Pro Gin Val Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr 
130 135 

AAP AAC CAG GTC AGC CTG ACC TGC CTG GTC AAA GGC TTC TAT CCC AGG 
i£ itn 2n vS Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Arg 
145 150 

CAC ATC GCC GTG GAG TGG GAG AGC AAT GGG CAG CCG GAG AAC AAC TAC 528 
Se S Val Glu Trp Glu Ser Asn Gly Gin Pro Glu Asn Asn Tyr 
165 170 
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AAG ACC ACG CCT CCC GTG CTG GAC TCC GAC GGC TCC TTC TTC CTC TAC 57 6 

Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr 
180 185 190 

AGC AAG CTC ACC GTG GAC AAG AGC AGG TGG CAG CAG GGG AAC GTC TTC 624 
Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gin Gin Gly Asn Val Phe 
195 200 205 

TCA TGC TCC GTG ATG CAT GAG GCT CTG CAC AAC CAC TAC ACG CAG AAG 672 
Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gin Lys 
210 215 220 

AGC CTC TCC CTG TCT CCG GGT AAA TGAACTAGT 705 
Ser Leu Ser Leu Ser Pro Gly Lys 
225 230 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 232 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Glu Pro Arg Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala 
15 10 15 

Pro Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro 
20 ~ 25 30 

Lys Asp Thr Leu Met lie Ser Arg Thr Pro Glu Val Thr Cys Val Val 
35 40 45 

Val Asp Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val 
50 55 60 

Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gin 
65 ~ 70 75 80 

Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gin 
85 90 95 

Asp Trp Leu Asn Gly Lys Asp Tyr Lys Cys Lys Val Ser Asn Lys Ala 
100 105 HO 

Leu Pro Ala Pro Met Gin Lys Thr He Ser Lys Ala Lys Gly Gin Pro 
115 120 125 

Arg Glu Pro Gin Val Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr 
130 135 140 

Lys Asn Gin Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Arg 
145 150 155 160 

His He Ala Val Glu Trp Glu Ser Asn Gly Gin Pro Glu Asn Asn Tyr 
165 ~ 170 175 
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Lys Thx Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr 
180 185 190 

Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gin Gin Gly Asn Val Phe' 
195 200 205 

Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gin Lys 
210 215 220 

Ser Leu Ser Leu Ser Pro Gly Lys 
225 230 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3182 base pairs 

(B) TYPE: nucleic acid. 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

Ciii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: N-terminal 

(vi) ORIGINAL SOURCE: 

(F> TISSUE TYPE: human placenta 

Cvii) IMMEDIATE SOURCE: 

(B) CLONE: pHLIFR-65 

(ix) FEATURE: 

(A) NAME /KEY: ioat_j>eptide 

(B) LOCATION: 311 -.3182 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 179.. 3182 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 
(B> LOCATION: 179.. 310 

(ix) FEATURE: 

(A) NAME/KEY: matjpeptide 

(B) LOCATION: 311.. 3182 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
AGATCTTGGA ACGAGACGAC CTGCTCTCTC TCCCAGAACG TGTCTCTGCT GCAAGGCACC 
GGGCCCTTTC GCTCTGCAGA ACTGCACTTG CAAGACCATT ATCAACTCCT AATCCCAGCT 120 
CAGAAAGGGA GCCTCTGCGA CTCATTCATC GCCCTCCAGG ACT GACTGCA TTGCACAG 178 



60 
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ATG ATG GAT ATT TAG GTA TGT TTG AAA CGA CCA TCC TGG ATG GTG GAC 226 

Met Met Asp lie Tyr Val Cys Leu Lys Arg Pro Ser Trp Met Val Asp 
-44 -40 -35 -30 

AAT AAA AGA ATG AGG ACT GCT TCA AAT TTC CAG TGG CTG TTA TCA ACA 274 

Asn Lys Arg Met Arg Thr Ala Ser Asn Phe Gin Trp Leu Leu Ser Thr 
-25 ^ -20 -15 

TTT ATT CTT CTA TAT CTA ATG AAT CAA GTA AAT AGC CAG AAA AAG GGG 322 
Phe lie Leu Leu Tyr Leu Met Asn Gin Val Asn Ser Gin Lys Lys Gly 
-10 -5 1 

GCT CCT CAT GAT TTG AAG TGT GTA ACT AAC AAT TTG CAA GTG TGG AAC 370 
Ala Pro His Asp Leu Lys Cys Val Thr Asn Asn Leu Gin Val Trp Asn 
5 10 15 20 

TGT TCT TGG AAA GCA CCC TCT GGA ACA GGC CGT GGT ACT GAT TAT GAA 418 
Cys Ser Trp Lys Ala Pro Ser Gly Thr Gly Arg Gly Thr Asp Tyr Glu 
25 30 35 

GTT TGC ATT GAA AAC AGG TCC CGT TCT TGT TAT CAG TTG GAG AAA ACC 466 
Val Cys lie Glu Asn Arg Ser Arg Ser Cys Tyr Gin Leu Glu Lys Thr 
40 45 50 

AGT ATT AAA ATT CCA GCT CTT TCA CAT GGT GAT TAT GAA ATA ACA ATA 514 
Ser lie Lys lie Pro Ala Leu Ser His Gly Asp Tyr Glu lie Thr lie 
55 60 65 

AAT TCT CTA CAT GAT TTT GGA AGT TCT ACA AGT AAA TTC ACA CTA AAT 562 
Asn Ser Leu His Asp Phe Gly Ser Ser Thr Ser Lys Phe Thr Leu Asn 
70 75 80 

GAA CAA AAC GTT TCC TTA ATT CCA GAT ACT CCA GAG ATC TTG AAT TTG 610 
Glu Gin' Asn Val Ser Leu lie Pro Asp Thr Pro Glu He Leu Asn Leu 
85 90 95 100 

TCT GCT GAT TTC TCA ACC TCT ACA TTA TAC CTA "AAG TGG AAC GAC AGG 658 
Ser Ala Asp Phe Ser Thr Ser Thr Leu Tyr Leu Lys Trp Asn Asp Arg 
105 110 115 

GGT TCA GTT TTT CCA CAC CGC TCA AAT GTT ATC TGG GAA ATT AAA GTT 7 06 

Gly Ser Val Phe Pro His Arg Ser Asn Val He Trp Glu He Lys Val 
120 125 130 

CTA CGT AAA GAG AGT ATG GAG CTC GTA AAA TTA GTG ACC CAC AAC ACA 7 54 

Leu Arg Lys Glu Ser Met Glu Leu Val Lys Leu Val Thr His Asn Thr 
135 140 145 

ACT CTG AAT GGC AAA GAT ACA CTT CAT CAC TGG AGT TGG GCC TCA GAT 802 
Thr Leu Asn Gly Lys Asp Thr Leu His His Trp Ser Trp Ala Ser Asp 
150 ~ ~ 155 160 

ATG CCC TTG GAA TGT GCC ATT CAT TTT GTG GAA ATT AGA TGC TAC ATT 850 
Met Pro Leu Glu Cys Ala He His Phe Val Glu He Arg Cys Tyr He 
165 170 175 180 

GAC AAT CTT CAT TTT TCT GGT CTC GAA GAG TGG AGT GAC TGG AGC CCT 898 
Asp Asn Leu His Phe Ser Gly Leu Glu Glu Trp Ser Asp Trp Ser Pro 
185 190 195 
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GTT AAA CTT TCT TGG CAT TTA CCA GGC AAC TTT GCA AAG ATT AAT TTT 
SS W Ser Txp His Leu Pro Gly Asn Phe Ala Lys He Asn Phe 

405 410 
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CTG AAG AAC ATT TCT TGG ATA CCT GAT TCT CAG ACT AAG GTT TTT CCT 946 
52 £S He Ser Trp He Pro Asp Ser Gin Thr Lys Val Phe Pro 

200 205 210 

CAA GAT AAA GIG ATA CTT GTA GGC TCA GAC ATA ACA TTT TGT TGI GTG 994 
S£ S£ Lys Val He Leu Val Gly Ser Asp He Thr Phe Cys Cys Val 
215 220 225 

AGT CAA GAA AAA GTG TTA TCA GCA CTG ATT GGC CAT ACA AAC TGC CCC 1042 
Ser £u Lys Val Leu Ser Ala Leu He Gly His Thr Asn Cys Pro 

TTG ATC CAT CTT GAT GG6 GAA AAT GTT GCA ATC AAG ATT CGT AAT ATT 1090 
Su Se Ss Leu Asp Gly Glu Asn Val Ala lie Lys He Arg Asn He 
245 250 255 

TCT GTT TCT GCA AGT AGT GGA ACA AAT GTA GTT TTT ACA ACC GAA GAT 1138 
Sr vS Ser Ala Ser Ser Gly Thr Asn Val Val Phe Thr Thr Glu Asp 
255 270 275 

AA C ATA TTT GGA ACC GTT ATT TTT GCT GGA TAT CCA CCA GAT ACT CCT 1186 
S Phe Gly Thr Val He Phe Ala Gly Tyr Pro Pro Asp Thr Pro 
280 285 

CAA CAA CTG AAT TGT GAG ACA CAT GAT TTA AAA GAA ATT ATA TGT AGT 1234 
3S S£ Su Asn Cys Glu Thr His Asp Leu Lys Glu He He Cys Ser 
295 300 305 

TGG AAT CCA GGA AGG GTG ACA GCG TTG GTG GGC CCA CGT GCT ACA AGC 1282 
S Sn Pro Gly Arg- Val Thr Ala Leu Val Gly Pro Arg Ala Thr Ser 
310 315 320 

TAC ACT TTA GTT GAA AGT TTT TCA GGA AAA TAT GTT AGA CTT AAA AGA 1330 
Sr t2 Leu Val Glu Ser Phe Ser Gly Lys Tyr Val Arg Leu Lys Arg 
325 330 

CCT GAA GCA CCT ACA AAC GAA AGC TAT CAA TTA TTA TTT CAA ATG CTT 1378 
Ala Glu Ala Pro iS Asn Glu Ser Tyr Gin Leu Leu Phe Gin Met Leu 
345 350 35 

CCA AAT CAA GAA ATA TAT AAT TTT ACT TTG AAT GCT CAC AAT CCG CTG 1426 
Pro Asn Gin Glu He Tyr Asn Phe Thr Leu Asn Ala His Asn Pro Leu 
360 365 370 

GGT CGA TCA CAA TCA ACA ATT TTA GTT AAT ATA ACT GAA AAA GTT TAT 1474 
Sv S Ser Gin Ser Thr He Leu Val Asn He Thr Glu Lys Val Tyr 
375 580 385 

CCC CAT ACT CCT ACT TCA TTC AAA GTG AAG GAT ATT AAT TCA ACA GCT 1522 
So Ss tS So tS Ser Phe Lys Val Lys Asp He Asn Ser Thr Ala 

395 400 



1570 



TTA TGT GAA ATT GAA ATT AAG AAA TCT AAT TCA GTA CAA GAG CAG CGG 1618 
Glu 
425 



TTts TGT GAA ATT GAA ATT AAG AAA ivr iuii ~ ~ — 

S Su lie Glu lie Lys Lys Ser Asn Ser Val Gin Glu Gin Arg 
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AAT GTC ACA ATC AAA GGA GTA GAA AAT TCA AGT TAT CTT GTT GCT CTG 1666 

Asn Val Thr lie Lys Gly Val Glu Asn Ser Ser Tyr Leu Val Ala Leu 
440 445 450 

GAC AAG TTA AAT CCA TAC ACT CTA TAT ACT TTT CGG ATT CGT TGT TCT 1714 
Asp Lys Leu Asn Pro Tyr Thr Leu Tyr Thr Phe Arg He Arg Cys Ser 
455 460 465 

ACT GAA ACT TTC TGG AAA TGG AGC AAA TGG AGC AAT AAA AAA CAA CAT 1762 
Thr Glu Thr Phe Trp Lys Trp Ser Lys Trp Ser Asn Lys Lys Gin His 
470 475 480 

TTA ACA ACA GAA GCC AGT CCT TCA AAG GGG CCT GAT ACT TGG AGA GAG 1810 
Leu Thr Thr Glu Ala Ser Pro Ser Lys Gly Pro Asp Thr Trp Arg Glu 
485 490 495 500 

TGG AGT TCT GAT GGA AAA AAT TTA ATA ATC TAT TGG AAG CCT TTA CCC 1858 
Trp Ser Ser Asp Gly Lys Asn Leu He He Tyr Trp Lys Pro Leu Pro 
505 510 515 

ATT AAT GAA GCT AAT GGA AAA ATA CTT TCC TAC AAT GTA TCG TGT TCA 1906 
He Asn Glu Ala Asn Gly Lys He Leu Ser Tyr Asn Val Ser Cys Ser 
520 525 530 

TCA GAT GAG GAA ACA CAG TCC CTT TCT GAA ATC CCT GAT CCT CAG CAC 1954 
Ser Asp Glu Glu Thr Gin Ser Leu Ser Glu He Pro Asp Pro Gin His 
535 540 545 

AAA GCA GAG ATA CGA CTT GAT AAG AAT GAC TAC ATC ATC AGC GTA GTG 2002 
Lys Ala Glu He Arg Leu Asp Lys Asn Asp Tyr He He Ser Val val 
550 555 560 

GCT AAA AAT TCT GTG GGC TCA TCA CCA CCT TCC AAA ATA GCG AGT ATG 2050 
Ala Lys Asn Ser Val Gly Ser Ser Pro Pro Ser Lys He Ala Ser Met 
56S 570 575 580 

GAA ATT CCA AAT GAT GAT CTC AAA ATA GAA CAA GTT GTT GGG ATG GGA 2 098 

Glu He Pro Asn Asp Asp Leu Lys He Glu Gin Val Val Gly Met Gly 
585 590 595 

AAG GGG ATT CTC CTC ACC TGG CAT TAC GAC CCC AAC ATG ACT TGC GAC 214 6 

Lys Gly He Leu Leu Thr Trp His Tyr Asp Pro Asn Met Thr Cys Asp 
600 605 610 

TAC GTC ATT AAG TGG TGT AAC TCG TCT CGG TCG GAA CCA TGC CTT ATG 2194 
Tyr Val He Lys Trp Cys Asn Ser Ser Arg Ser Glu Pro Cys Leu Met 
615 620 625 

GAC TGG AGA AAA GTT CCC TCA AAC AGC ACT GAA ACT GTA ATA GAA TCT 2242 
Asp Trp Arg Lys Val Pro Ser Asn Ser Thr Glu Thr Val He Glu Ser 
630 635 640 

GAT GAG TTT CGA CCA GGT ATA AGA TAT AAT TTT TTC CTG TAT GGA TGC 2290 
Asp Glu Phe Arg Pro Gly He Arg Tyr Asn Phe Phe Leu Tyr Gly Cys 
645 " 650 655 660 

AGA AAT CAA GGA TAT CAA TTA TTA CGC TCC ATG ATT GGA TAT ATA GAA 2338 
Arg Asn Gin Gly Tyr Gin Leu Leu Arg Ser Met He Gly Tyr He Glu 
t 665 670 675 
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GAA TTG GCT CCC ATT GTT GCA CCA. AAT TTT ACT GTT GAG GAT ACT TCT 2386 
Glu Leu Ala Pro He Val Ala Pro Asn Phe Thr Val Glu Asp Thr Ser 
680 685 690 

GCA GAT TCG ATA TTA GTA AAA TGG GAA GAC ATT CCT GTG GAA GAA CTT 2434 
Ala Asp Ser lie Leu Val Lys Trp Glu Asp He Pro Val Glu Glu Leu 
595 700 705 

AG* GGC TTT TTA AGA GGA TAT TTG TTT TAC TTT GGA AAA GGA GAA AGA 2482 
25 Phe Leu Arg Gly Tyr Leu Phe Tyr Phe Gly Lys Gly Glu Arg 
710 715 720 

GAC ACA TCT AAG ATG AGG GTT TTA GAA TCA GGT CGT TCT GAC ATA AAA 2530 
Asp Thr Ser Lys Met Arg Val Leu Glu Ser Gly Arg Ser Asp He Lys 
725 7 30 735 740 

GTT AAG AAT ATT ACT GAC ATA TCC CAG AAG ACA CTG AGA ATT GCT GAT 2578 
Val Lys Asn He Thr Asp He Ser Gin Lys Thr Leu Arg He Ala Asp 
745 750 '55 

CTT CAA GGT AAA ACA AGT TAC CAC CTG GTC TTG CGA GCC TAT ACA GAT 2626 
£eu Gin Gly Lys Thr Ser Tyr His Leu Val Leu Arg Ala Tyr Thr Asp 
760 76 5 770 

GGT GGA GTG GGC CCG GAG AAG AGT ATG TAT GTG GTG ACA AAG GAA AAT 2674 
Gly Gly Val Gly Pro Glu Lys Ser Met Tyr Val Val Thr Lys Glu Asn 
X 775 7 80 7 85 

TCT GTG GGA TTA ATT ATT GCC ATT CTC ATC CCA GTG GCA GTG GCT GTC 2722 
Ser Val Gly Leu He He Ala He Leu He Pro Val Ala Val Ala Val 
790 795 800 

ATT GTT GGA GTG GTG ACA AGT ATC CTT TGC TAT CGG AAA CGA GAA TGG 2770 
lie Val Gly Val Val Thr Ser He Leu Cys Tyr Arg Lys Arg Glu Trp 
805 810 815 820 

ATT AAA GAA ACC TTC TAC CCT GAT ATT CCA AAT CCA GAA AAC TGT AAA 2818 
lie Lys Glu .Thr Phe Tyr Pro Asp He Pro Asn Pro Glu Asn Cys Lys 
825 830 835 

GCA TTA CAG TTT CAA AAG AGT GTC TGT GAG GGA AGC AGT GCT CTT AAA 2866 
Ala Leu Gin Phe Gin Lys Ser Val Cys Glu Gly Ser Ser Ala Leu Lys 
840 845 850 

ACA TTG GAA ATG AAT CCT TGT ACC CCA AAT AAT GTT GAG GTT CTG GAA 2914 
Thr Leu Glu Met Asn Pro Cys Thr Pro Asn Asn Val Glu Val Leu Glu 
855 860 865 

ACT CGA TCA GCA TTT CCT AAA ATA GAA GAT ACA GAA ATA ATT TCC CCA 2962 
Thr Arg Ser Ala Phe Pro Lys He Glu Asp Thr Glu He He Ser Pro 
870 875 880 

GTA GCT GAG CGT CCT GAA GAT CGC TCT GAT GCA GAG CCT GAA AAC CAT 3010 
Val Ala Glu Arg Pro Glu Asp Arg Ser Asp Ala Glu Pro Glu Asn His 
885 ~ 890 895 900 

GTG GTT GTG TCC TAT TGT CCA CCC ATC ATT GAG GAA GAA ATA CCA AAC 3058 
Val Val Val Ser Tyr Cys Pro Pro He He Glu Glu Glu He Pro Asn 
905 910 915 
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CCA GCC GCA GAT GAA GCT GGA GGG ACT GCA CAG GTT ATT TAC ATT GAT 3106 

Pro Ala Ala Asp Glu Ala Gly Gly Thr Ala Gin Val lie Tyr lie Asp 
920 925 930 

GTT CAG TCG ATG TAT CAG CCT CAA GCA AAA CCA GAA GAA AAA AAA AAA 3154 
Val Gin Ser Met Tyr Gin Pro Gin Ala Lys Pro Glu Glu Lys Lys Lys 
935 940 945 

AAA AGC AGG TCG TCT CGT TCC AAG ATC T 3182 
Lys Ser Arg Ser Ser Arg Ser Lys lie 
950 955 

(2> INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1001 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xij SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Met Asp lie Tyr Val Cys Leu Lys Arg Pro Ser Trp Met Val Asp 
-44 -40 -35 -30 

Asn Lys Arg Met Arg Thr Ala Ser Asn Phe Gin Trp Leu Leu Ser Thr 
-25 -20 -15 

Phe lie Leu Leu Tyr Leu Met Asn Gin Val Asn Ser Gin Lys Lys Gly 
-10 -5 1 

Ala Pro His Asp Leu Lys Cys Val Thr Asn Asn Leu Gin Val Trp Asn 
5 10 15 20 

Cys Ser Trp Lys Ala Pro Ser Gly Thr Gly Arg Gly Thr Asp Tyr Glu 
25 30 35 

Val Cys lie Glu Asn Arg Ser Arg Ser Cys Tyr Gin Leu Glu Lys Thr 
40 45 50 

Ser lie Lys lie Pro Ala Leu Ser His Gly Asp Tyr Glu He Thr He 
55 60 65 

Asn Ser Leu His Asp Phe Gly Ser Ser Thr Ser Lys Phe Thr Leu Asn 
70 75 80 

Glu Gin Asn Val Ser Leu He Pro Asp Thr Pro Glu He Leu Asn Leu 
85 90 95 100 

Ser Ala Asp Phe Ser Thr Ser Thr Leu Tyr Leu Lys Trp Asn Asp Arg 
105 HO 115 

Gly Ser Val Phe Pro His Arg Ser Asn Val He Trp Glu He Lys Val 
120 125 130 

Leu Arg Lys Glu Ser Met Glu Leu Val Lys Leu Val Thr His Asn Thr 
135 140 145 
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Thr Leu Asn Gly Lys Asp Thr Leu His His Trp Ser Trp Ala Ser Asp 

Met Pro Leu Glu Cys Ala He His Phe Val Glu He Arg Cys Tyr lie ' 
165 170 175 

Asp Asn Leu His Phe Ser Gly Leu Glu Glu Trp Ser Asp Trp Ser Pro 

Val Lys Asn He Ser Trp He Pro Asp Ser Gin Thr Lys Val Phe Pro 
200 2Q5 

Gin Asp Lys Val He Leu Val Gly Ser Asp He Thr Phe Cys Cys Val 

Ser Gin Glu Lys Val Leu Ser Ala Leu He Gly His Thr Asn Cys Pro 

230 235 
leu He His Leu Asp Gly Glu Asn Val Ala He Lys He Arg Asn He 
245 250 255 

Ser Val Ser Ala Ser Ser Gly Thr Asn Val Val Phe Thr Thr Glu Asp 
265 270 

He Phe Ala Gly Tyr Pro Pro Asp Thr Pro 

280 



Asn He Phe Gly Thr Val He Phe Ala Gly Tyr Pro Pro „ 



Gin Gin Leu Asn Cys Glu Thr His Asp Leu Lys Glu lie He Cys Ser 



295 

Val Gly Pro Arg Ala Thr Ser 

310 



Trp Asn Pro Gly Arg Val Thr Ala Leu ^ 



Tyr Thr Leu Val Glu Ser Phe Ser Gly Lys Tyr Val Arg Leu Lys Arg 
325 330 335 

Ala Glu Ala Pro Thr Asn Glu Ser Tyr Gin Leu Leu Phe Gin Met Leu 
345 350 

Pro Asn Gin Glu He Tyr Asn Phe Thr Leu Asn Ala His Asn Pro Leu 

360 365 
Gly Arg Ser Gin Ser Thr He Leu Val Asn He Thr Glu Lys Val Tyr 
375 

Pro His Thr Pro Thr Ser Phe Lys Val Lys Asp lie Asn Ser Thr Ala 

390 395 
Val Lys Leu Ser Trp His Leu Pro Gly Asn Phe Ala Lys He Asn Phe 
405 410 

leu Cys Glu He Glu He Lys Lys Ser Asn Ser Val Gin Glu Gin Arg 
425 430 



Asn Val Thr He Lys Gly Val Glu Asn Ser Ser Tyr Leu Val Ala Leu 
440 445 



450 



Asp Lys Leu Asn Pro Tyr Thr Leu Tyr Thr Phe Arg He Arg Cys Ser 
W 455 4 60 465 
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Thr Glu Thr Phe Trp Lys Trp Ser Lys Trp Ser Asn Lys Lys Gin His 
470 * 475 480 

Leu Thr Thr Glu Ala Ser Pro Ser Lys Gly Pro Asp Thr Trp Arg Glu 
485 490 495 500 

Trp Ser Ser Asp Gly Lys Asn Leu lie He Tyr Trp Lys Pro Leu Pro 
505 510 515 

He Asn Glu Ala Asn Gly Lys He Leu Ser Tyr Asn Val Ser Cys Ser 
520 525 530 

Ser Asp Glu Glu Thr Gin Ser Leu Ser Glu lie Pro Asp Pro Gin His 
535 540 545 

Lys Ala Glu lie Arg Leu Asp Lys Asn Asp Tyr He He Ser Val Val 
550 555 560 

Ala Lys Asn Ser Val Gly Ser Ser Pro Pro Ser Lys He Ala Ser Met 
565 570 575 580 

Glu He Pro Asn Asp Asp Leu Lys He Glu Gin Val Val Gly Met Gly 
585 590 595 

Lys Gly He Leu Leu Thr Trp His Tyr Asp Pro Asn Met Thr Cys Asp 
600 605 610 

Tyr Val He Lys Trp Cys Asn Ser Ser Arg Ser Glu Pro Cys Leu Met 
615 620 625 

Asp Trp Arg Lys Val Pro Ser Asn Ser Thr Glu Thr Val He Glu Ser 
630 635 64 0 

Asp Glu Phe Arg Pro Gly He Arg Tyr Asn Phe Phe Leu Tyr Gly Cys 
645 650 655 660 

Arg Asn Gin Gly Tyr Gin Leu Leu Arg Ser Met He Gly Tyr He Glu 
665 670 675 

Glu Leu Ala Pro He Val Ala Pro Asn Phe Thr Val Glu Asp Thr Ser 
680 685 690 

Ala Asp Ser He Leu Val Lys Trp Glu Asp He Pro Val Glu Glu Leu 
695 700 705 

Arg Gly Phe Leu Arg Gly Tyr Leu Phe Tyr Phe Gly Lys Gly Glu Arg 
710 715 720 

Asp Thr Ser Lys Met Arg Val Leu Glu Ser Gly Arg Ser Asp He Lys 
725 ~ 730 735 740 

Val Lys Asn He Thr Asp He Ser Gin Lys Thr Leu Arg He Ala Asp 
745 750 755 

Leu Gin Gly Lys Thr Ser Tyr His Leu Val Leu Arg Ala Tyr Thr Asp 
760 765 770 

Gly Gly Val Gly Pro Glu Lys Ser Met Tyr Val Val Thr Lys Glu Asn 
775 ~ 780 785 
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Ser Val Gly I« Xle He Ala He Leu He Pro Val Ala Val Ala Val 

790 795 
He Val Gly Val Val Thr Ser He Leu Cys Tyr Arg Lys Arg Glu Trp 
805 810 

He Lys Glu Thr Phe Tyr Pro Asp He Pro Asa Pro Glu Asn Cys Lys 



825 



Ala Leu Gin Phe Gin Lys Ser Val Cys Glu Gly Ser Ser Ala Leu Lys 
840 845 0 



Thr Leu Glu Met Asn Pro Cys Thr Pro Asn Asn Val Glu Val Leu Glu 
855 860 ot>D 



Thr Arg Ser Ala Phe Pro Lys He Glu Asp Thr Glu He He Ser Pro 

870 875 880 

Val Ala Glu Arg Pro Glu Asp Arg Ser Asp Ala Glu Pro Glu Asn His 
885 890 

Val Val Val Ser Tyr Cys Pro Pro He He Glu Glu Glu He Pro Asn 

Pro Ala Ala Asp Glu Ala Gly Gly Thr Ala Gin Val He Tyr He Asp 
920 925 

Val Gin Ser Met Tyr Gin Pro Gin Ala Lys Pro Glu Glu Lys Lys Lys 
935 940 9 

Lys Ser Arg Ser Ser Arg Ser Lys He 
950 955 

(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GATCCGGTGG AGGTGGTTCT GGTGGAGGTG GTTCAGGTGG TGGAGGATCA GGAGGTGGTG 60 
GATCAGGTGG AGGAGGTTCT GGAGGTGGAG GTTCCGGAAT 100 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
(A* LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) . SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GATATGTCGA CGATGATGGA TATTTACGTA TGTTTG 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GCATGGATCC ACCTCCTCCA GAATTTTCCT TTGTCACCAC ATACATAC 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 49 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CGCGTCCGGA GGAGGTGGAT CTGAACTTCT AGATCCATGT GGTTATATC 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 
GCATGCGGCC GCCTATTCAA TTTCTCCTTG AGCAAAC 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GATATGTCGA CAAGATGTTG ACGTTGCAGA CTTGG 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) IiENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GCATGGATCC ACCTCCTCCT TCAATTTCTC CTTGAGCAAA C 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
(A> LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CGCGTCCGGA GGAGGTGGTA GCCAGAAAAA GGGGGCTCCT CATG 
(2) INFORMATION FOR SEQ ID NO: 15 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GCATGCGGCC GCTAAGAATT TTCCTTTGTC ACCACATACA TAC 
(2) INFORMATION FOR SEQ ID NO:lG: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GCATAGATCT GGGCTCAGAA TTTTCCTTTG TCACCACATA CATAC 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GCATAGATCT GGGCTCTTCA ATTTCTCCTT GAGCAAAC 
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CLAIMS 



What is claimed is: 



1. A receptor capable of binding oncostatin M and leukemia inhibitory 
factor, comprising gpl30 covalently linked to UF-R. 

2. A receptor according to claim 1, wherein said receptor comprises a 
10 soluble gpl30 polypeptide covalendy linked to a soluble UF-R polypeptide. 

3 . A receptor according to claim 1 wherein said receptor comprises gpl30 
covalendy linked to UF-R via a polypeptide linker. 

15 4. a receptor according to claim 3, wherein said receptor is a recombinant 

fusion protein of the formula: 

R1-L-R2CTR2-L-R1 

20 wherein Ri represents gpl30; R 2 represents LIF-R; and L represents a polypeptide 
linker. 

5. A receptor according to claim 4 wherein the polypeptide linker 
comprises from 20 to 100 amino acids selected from the group consisting of glycine, 

25 asparagine, serine, threonine, and alanine. 

6. A receptor according to claim 5 wherein the polypeptide linker 
comprises an amino acid sequence selected from the group consisting of : 
(Gly 4 -Ser-Gly5-Ser)2 and (Gly4-Ser)n, wherein n is 4-12. 



30 



35 



7. An isolatedDNA sequence encoding the receptor of claim 4. 

8. A recombinant expression vector comprising the DNA sequence of 
claim 7. 

9. A host cell containing the expression vector of claim 8. 
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10. A receptor according to claim 3, comprising a first fusion polypeptide 
that comprises an antibody Fc region polypeptide attached to the C-terminus of gpl30, 
and a second fusion polypeptide that comprises an antibody Fc region polypeptide 
attached to the C-terminus of LIF-R, wherein said first fusion polypeptide is linked to 

5 said second fusion polypeptide via disulfide bonds between the Fc region polypeptides. 

11. A receptor according to claim 1, 4, or 10, wherein: 

a) said gpl30 is encoded by an isolated DNA selected from the group 
10 consisting of a first DNA sequence comprising nucleotides 244-2369 of SEQ ED NO: 1 , 
a second DNA sequence comprising nucleotides 310-2369 of SEQ ID NO:l, and a 
thiid DNA sequence that will hybridize to said second DNA sequence under moderately 
stringent conditions; and 

15 b) said LIF-R is encoded by an isolated DNA selected from the group 

consisting of a first DNA sequence comprising nucleotides 179-3182 of SEQ ID NO:5, 
a second DNA sequence comprising nucleotides 311-3182 of SEQ ID NO:5, and a 
third DNA sequence that will hybridize to said second DNA sequence under moderately 
stringent conditions. 

20 

12. A receptor according to claim 10 wherein said gpl30 is a soluble gp 130 
polypeptide and said LIF-R is a soluble LIF-R polypeptide. 

13. A fusion protein comprising an antibody Fc region polypeptide attached 
25 to the C-terminus of a soluble gp 1 30 polypeptide. 

14. An isolated DNA sequence encoding a fusion protein according to claim 

13. 

30 15. A fusion protein comprising an antibody Fc region polypeptide attached 

to the C-terminus of a soluble LIF-R polypeptide. 

16. An isolated DNA sequence encoding a fusion protein according to claim 

15. 

35 

17. A homodimeric receptor comprising two fusion proteins according to 
claim 15, linked via disulfide bonds between the Fc region polypeptides. 
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18. A process for preparing a receptor according to claim 4, comprising 
culturing a host cell transformed with an expression vector comprising a DNA sequence 
that encodes said fusion protein under conditions that promote expression of said 

5 fusion protein, and recovering said fusion protein. 

19. A process for preparing a receptor according to claim 10, comprising 
culturing a host cell co-transfected with a first expression vector encoding said first 
fusion polypeptide and with a second expression vector encoding said second fusion 

10 polypeptide under conditions that promote expression of said first and second fusion 
polypeptides, and recovering said receptor. 

20. A pharmaceutical composition for treating a disorder mediated by 
oncostatin M or LEF, comprising the receptor of claim 1, 4, 10, or 12, and a suitable 

15 diluent or carrier. 
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FIGURE 4 
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